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Okechukwu  Ugweje 
Electrical  Engineering  Department 
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Abstract 


A  series  of  experiments  were  conducted  via  the  ACTS  DS-3  link  carrying  ATM  traffic  in  a  PLCP  frame  format. 
These  experiments  were  performed  between  Rome  Laboratory,  Griffiss  Air  Force  Base,  New  York  and  Communi¬ 
cation  Research  Centre,  Nepean,  Canada.  These  tests  were  conducted  to  test  the  viability  of  ATM  DS-3  bearers 
via  the  satellite  at  a  performance  level  comparable  to  that  of  terrestrial  fiber  optic  link  or  microwave  link.  In 
these  tests,  both  single  and  multiple  channels  configurations  of  ATM  satellite  bearers  were  tested.  The  single 
channel  experiments  were  configure  to  fully  load  the  channel  at  96,000  Cells  per  second,  while  the  multiple  chan¬ 
nel  experiments  were  used  to  assess  the  ability  of  ATM  protocol  to  carry  information  from  different  sources  at  dif¬ 
ferent  rates.  Our  main  objective  of  characterizing  the  DS-3  satellite  channel  for  the  transmission  of  ATM  signals 
was  achieved,  and  the  result  obtained  were  presented  and  analyzed. 
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ATM  DS-3  EXPERIMENTS  VIA  THE  ADVANCED  COMMUNICATION 
TECHNOLOGY  SATELLITE 


Valentine  Aalo 
Okechukwu  Ugweje 
Mostafa  Chinichian 


1.0  INTRODUCTION 

The  popularity  of  Asynchronous  Transfer  Mode  (ATM)  has  risen  to  a  new  height,  and  all  efforts  are  being  made 
to  experimentally  verify  its  potential  and  usefulness.  It  is  indicated  in  [1],  that  in  the  next  decade,  most  telecom¬ 
munication  traffic  will  be  carried  by  ATM  technology.  ATM  is  now  regarded  as  the  communication  protocol  of 
the  future  and  will  play  a  significant  role  in  the  “information  superhighway”.  The  usefulness  and  adaptability  of 
ATM  to  current  technology  is  also  the  subject  of  lots  of  research  in  both  Government  and  industry.  ATM  tech¬ 
nology  and  its  applications  are  still  being  developed  and  tested.  Global  deployment  of  ATM  technology  may  take 
some  time  as  most  economical  and  effective  means  of  implementing  this  evolving  technology  is  still  being  stud¬ 
ied. 

Although  ATM  was  originally  intended  for  fiber  optic  links  transmitting  high  speed  data,  it  can  be  used  for  other 
links  such  as  the  satellite  link,  which  is  the  subject  of  this  experiment.  Satellites  will  play  a  vital  role  in  the  provi¬ 
sion  of  ATM-based  sendees  such  as  voice,  data,  video  and  imaging,  on  demand,  at  very  high  data  rates  and  at  any 
location.  Many  performance  issues  regarding  the  transfer  of  ATM  information  bearers  via  the  satellite  channel 
remain  unresolved,  especially  at  Ka-band  where  the  performance  of  satellite  communication  systems  are  severely 
impaired  by  atmospheric  propagation  effects,  especially  rain  attenuation.  We  are  interested  in  studying  by  means 
of  actual  experiment  those  characteristics  of  ATM  protocol  that  is  suitable  and  reliable  for  the  transmission  of  in¬ 
formation  via  satellite  Ka-band  at  DS-3  rates. 

The  aim  of  this  test  was  to  demonstrate  the  practicality  of  transmitting  and  receiving  ATM  Cells  using  the  Ad¬ 
vanced  Communication  Technology  Satellite  (ACTS).  This  report  documents,  in  a  summarized  form,  the  experi¬ 
ment  of  transmitting  ATM  signal  via  the  ACTS  conducted  in  the  Summer  of  1995  at  Rome  Laboratory  (RL), 
Griffiss  Air  Force  Base  (AFB),  New  York.  These  experiments  are  part  of  the  global  grid  Technical  Technology 
Cooperation  Program  (TTCP)  network,  in  which  RL  will  communicate  via  the  satellite  with  the  Canadians  at  the 
Communication  Research  Center  (CRC)  [2],  These  experiments  were  conducted  via  the  satellite  between  the 
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CRC,  Nepean.  Canada  and  RL,  Griffiss  Air  Force  Base  (GAFB),  New  York.  This  experiment  is  part  of  the  on¬ 
going  effort  to  link  the  assets  of  the  Air  Force  at  RL.  the  Army  at  the  Communication  Electronic  Command 
(CECOM),  and  the  Navy  at  the  Naval  Research  and  Development  (NRaD)  [3],  This  report  is  based  on  the  seg¬ 
ment  of  the  experiments  that  were  conducted  from  27  June  1995  through  24  August  1995.  Several  tests  were  per¬ 
formed  via  a  satellite  DS-3  link  carrying  ATM  traffic  in  a  Physical  Layer  Convergence  Protocol  (PLCP)  frame 
format.  In  these  tests,  both  single  and  multiple  channels  configuration  of  ATM  satellite  bearers  were  tested.  The 
single  channel  configuration  fully  loads  the  channel  at  96,000  Cells/s,  while  the  multiple  channel  configuration 
was  used  to  assess  the  ability  of  ATM  protocol  to  carry  information  from  different  sources  at  different  transmis¬ 
sion  rates. 

2  .0  SETUP  AND  CONFIGURATIONS 

The  first  stage  of  this  experiment  involved  the  characterization  of  the  satellite  channel  for  reliable  transmission  of 
ATM  signals  at  DS-3  rates  via  the  ACTS.  In  the  second  stage,  ATM  Cells  bearers  were  transmitted  and  received 
via  the  ACTS  satellite  at  DS-3  rates.  In  the  following  sections  the  resources  and  experimental  procedure  used  in 
these  tests,  particularly  at  the  RL  test  site,  are  briefly  described. 

2. 1  Test  Sites  and  Facilities 

As  indicated  earlier,  as  part  of  the  global  grid  network,  RL,  New  York,  will  communicate  via  the  satellite  with 
CRC,  Canada.  The  basic  configuration  used  to  connect  CRC  and  RL  via  the  ACTS  is  shown  in  Figure  1.  This 
configuration  simply  shows  the  terminal  equipments  and  the  ATM  equipments  in  block  diagram  form.  Details 
are  not  shown.  This  configuration  includes  the  ATM  generator/analyzer  equipment  and  a  PC  workstation  fitted 
with  ATM  adapter  card.  This  setup  is  briefly  described  below. 

2.1.1  Rome  Laboratory,  Griffiss  AFB,  New  York 

At  RL,  the  satellite  transmission  facilities  is  located  outside  in  two  huts  south  of  building  3.  The  high  data  rate 
antenna  is  1.8m  with  gain  of  60.40  dBi  for  Uplink  and  48.53  dBi  for  Downlink.  The  antenna  is  positioned  at 
Latitude:  43°  13\  13.18”  N 

Longitude:  75°  24’,  34.00”  W 

Azimuth  214.1°;  Elevation  34.68°;  Polarization  Tilt  24.43° 

In  addition  to  the  1.8m  high  data  rate  antenna,  other  terminal  equipments  at  the  RL  test  site  consist  of  the  EF 
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ACTS 


Rome  Laboratory  Test  Site 
Griffiss  Air  Force  Base,  New  York 


CRC  Test  Site 
Nepean,  Ottawa,  Canada 


Figure  1:  ATM  DS-3  Test  Configuration 


Data  modem  (model  SDM-9000),  Ka-band  traveling-wave-tube  amplifier  (TWTA),  test  loop  translator  (TLT), 
converters  and  amplifiers.  The  EF  Data  modem  is  a  high  performance,  full-duplex,  digital-vector  modem  with 
data  rate  capabilities  of  6  Mbps  to  51.84  Mbps.  The  modem  has  built-in  scrambler/descrambler,  differential  en¬ 
coding/decoding,  multi-rate  forward  error  correction  capabilities  (convolutional  encoder  and  Viterbi  decoder)  and 
can  be  configured  to  add  overhead/framing  to  the  data.  For  example,  a  Reed-Solomon  (RS)  encoding  and  decod¬ 
ing  is  provided  to  work  with  the  built-in  Viterbi  decoder,  in  conjunction  with  additional  framing  and  interleav¬ 
ing,  resulting  in  improved  overall  performance. 

At  the  RL  test  site,  the  ATM  generator/analyzer,  AX/4000,  manufacture  by  AdTech,  Inc.  is  used  as  the  ATM 
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test-bed  equipment.  This  equipment  provides  complete  signally,  switching,  generation  and  analysis  of  ATM  traf¬ 
fic.  It  is  modular  in  nature,  allowing  for  custom  configuration  for  a  variety  of  ATM  test  applications.  This  unit 
consists  of  the  mainframe,  the  ATM  generation/analyzer  modules  and  is  connected  to  a  window-based  worksta¬ 
tion.  While  a  variety  of  port  interfaces  are  possible  (for  example,  SONET  OC-3c,  E3,  TAXI,  and  DS-3),  the  in¬ 
terface  provided  with  the  AX/4000  is  the  DS-3  interface  using  electrical  format  and  BNC  connection  to  the  EF 
Data  modem,  and  IEEE  804.2  interface  to  the  PC  workstation.  Using  the  associated  AX/4000  Microsoft  window- 
based  software,  one  can  select  different  configurations  of  single  or  multiple  ports  (each  port  has  up  to  16  channels 
or  substreams)  with  capabilities  of  up  to  140  Mbps.  However,  in  this  experiment,  we  are  only  interested  in  the 
DS-3  speed  of  45  Mbps,  and  the  AX/4000  is  capable  of  generating  a  lull  DS-3  Cell  stream. 

The  AX/4000  ATM  equipment  is  installed  in  the  adjacent  hut,  about  25  feet  from  the  first  hut  as  shown  in  Figure 
1.  The  Fireberd  Communication  Analyzer  (model  MC6000),  HP  Spectrum  Analyzers  (HP  8568A  &  HP  8563), 
and  the  EF  Data  modem  are  co-located  with  the  ATM  equipment.  Note  that  all  connections  at  the  RL  test  site 
uses  electrical  format. 

2.1.2  Communication  Research  Centre,  Nepean,  Canada 

The  ATM  test-bed  at  the  CRC  test  site  is  located  in  the  Broadband  Application  and  Demonstration  Lab 
(BADLAB)  in  building  2D  [4],  The  satellite  communication  (satcom)  facility  is  located  in  building  46.  The  sat- 
com  facility  and  the  ATM  facility  are  linked  by  fiber  optic  line,  over  off-net  extension  via  the  satellite  ATM  links. 

The  antenna  at  CRC  test  site  is  4.2m  with  gain  of  52.34  dBi  for  Uplink  and  56.59  dBi  for  Downlink.  The  posi¬ 
tion  of  the  antenna  is  at 

Latitude:  45°  2 1’,  9.95”  N 

Longitude:  75°  54’,  9.59”  W 

Azimuth  220.85°;  Elevation  29.16°;  Polarization  Tilt  48° 

The  terminal  facilities  (Modem,  TLT,  TWTA)  at  CRC  test  site  is  the  same  as  RL  test  site.  There  may  be  minor 
variations  in  terms  of  the  location  and  connectivity  of  the  equipments  but  the  functionality  of  all  the  equipments 
at  the  two  test  sites  remain  the  same.  It  is  perhaps  important  to  note  that  most  of  the  terminal  equipments  used  in 
this  experiment  at  both  test  sites  were  provided  by  Canada’s  CRC  in  corporation  with  Air  Force’s  Rome  Labora¬ 
tory.  Their  contribution  include  the  frequency  plan,  the  redesign  of  the  converters  and  amplifiers  to  be 
compatible  with  ACTS  Ka-band  frequencies  at  DS-3  rates,  and  the  actual  test  procedure  used  in  these  experi¬ 
ments. 
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2.2  Satellite  Facilities 


The  Advanced  Communication  Technology  Satellite  (ACTS)  was  used  for  this  experiment.  Access  to  the  ACTS 
was  provided  by  NASA  Lewis,  Cleveland.  Ohio.  The  ACTS  operations  center  at  NASA  Lewis  provided  all  ac¬ 
cess  for  the  East  Scan  4E  spot  beam  used  by  RL  test  site  as  well  as  the  Steerable  spot  beam  used  by  CRC. 

During  these  tests.  RL  transmitted  horizontally  with  polarization  tilt  of  24.4°  clockwise  into  the  East  Scan  4E 
beam  at  29. 125  GHz  and  received  in  the  orthogonal  plane  at  19.505  GHz.  On  the  other  hand,  CRC,  transmitted 
vertically  with  polarization  tilt  of  22.2°  clockwise  into  the  steerable  antenna  at  29.225  GHz  and  received  orthogo¬ 
nally  at  19.405  GHz  [51.  Because  of  the  enormous  loss  in  orthogonal  polarization  as  seen  from  the  satellite  EIRP 
and  G/T  coverage,  self  loopbacks  are  not  practical.  It  should  be  pointed  out,  however,  that  the  ACTS  operation 
center  at  NASA  Lewis  can  provide  satellite  loopbacks  for  both  the  steerable  and  East  Scan  4E  spot  beams.  This 
implies  that  NASA  Lewis  can  serve  as  a  loopback  node  for  either  RL  test  site  or  CRC  test  site. 

3.0  EXPERIMENTAL  PROCEDURE 

3.1  Channel  Characterization 

The  sequence  of  tests  involving  the  transmission  of  ATM  Cells  bearers  via  the  ACTS  started  on  27  June  1995. 
The  first  set  of  tests  were  aimed  at  characterizing  the  channel  between  RL,  and  CRC,  and  to  familiarize  RL  per¬ 
sonnel  with  the  ground  terminal  equipment.  The  objective  of  the  channel  characterization  experiments  is  to  eval¬ 
uate  the  performance  of  the  EF  Data  modem  with  the  Fireberd  tester  at  DS-3  rates  over  the  ACTS.  For  the  chan¬ 
nel  characterization  experiments,  the  setup  is  as  shown  in  Figure  1  without  the  AdTech  AX/4000  equipment  con¬ 
nected. 

Two  types  of  modulation  schemes,  namely,  the  QPSK  and  8PSK  were  emphasized  in  this  experiment.  The  char¬ 
acterization  tests  as  well  as  the  ATM  transmission  tests  were  conducted  using  both  QPSK  and  8PSK  with  and 
without  RS  encoding  for  different  modem  power  levels.  During  this  period  of  characterization,  a  number  of 
transmissions  involving  continuous  wave  (CW)  at  the  two  modulation  schemes  were  performed.  This  was  done 
to  evaluate  the  EF  Data  modem  and  the  Fireberd  DS-3  test  equipment.  The  channel  characterization  and  modem 
evaluation  are  necessary  to  ensure  that  at  DS-3  rates  and  in  the  Ka-band  the  modem  specifications  for  the  differ¬ 
ent  modulations  can  be  verified.  During  the  CW  transmissions,  the  transmit  power  levels  at  both  ends  (CRC  and 
RL)  were  varied  (with  the  TX  power  meter  reading  recorded)  while  the  carrier  power  (C)  and  the  noise  power 
(N0)  are  read  from  the  Spectrum  Analyzer  at  opposite  ends. 
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The  bit  error  rate  (BER)  is  measured  using  the  Fireberd  Analyzer.  It  is  also  possible  to  measure  the  BER  using 
the  EF  Data  modem  but  we  did  not  use  the  measurement  from  the  modem.  To  evaluate  the  modem  and  the  Fire¬ 
berd  BER  tester,  the  QPSK  with  RS,  QPSK  without  RS,  8PSK  with  RS  and  8PSK  without  RS  were  used  as  the 
modulation  schemes.  QPSK  modulation  is  at  the  code  rate  of  3/4  while  8PSK  is  at  the  code  rate  of  2/3.  In  each 
case,  both  the  received  BER  and  E,/N0  were  measured  for  each  setting  of  the  transmit  power.  All  transmissions, 
by  default,  included  the  modem’s  built-in  convolutional  and  Viterbi  decoding  at  the  indicated  code  rates. 

At  each  transmit  modem  power  level,  the  transmit  power  meter  reading  were  recorded,  as  well  as  the  received  C 
and  N0  at  the  other  end.  From  these  measurements,  the  EJ N0  is  given  by  E,/N0  =  C/N0  - 10  log(44.73X106), 
where  C  and  N0  are  measured  with  the  Spectrum  Analyzer 


3.2  ATM  Transmission  Experiments 


A  series  of  experiments  were  conducted  via  the  ACTS  DS-3  link  carrying  ATM  traffic  in  a  PLCP  frame  format. 
The  main  aim  of  the  ATM  experiments  is  to  characterize  the  DS-3  satellite  link  for  the  transmission  of  ATM 
signals.  Specifically,  some  ATM  QoS  parameters  were  measured  using  different  levels  of  modulation  (QPSK, 
8PSK)  with  forward  error  correction  and  V.35  data  scrambling.  For  the  ATM  experiments,  the  setup  is  as  shown 
in  Figure  1,  with  the  AdTech  equipment  connected;  As  indicated  earlier,  the  AdTech  equipment ,  AX/4000,  is 
used  as  the  ATM  test-bed.  The  desired  configuration  is  set  by  means  of  the  window-based  software  accompany¬ 
ing  the  AX/4000.  There  are  two  types  of  transmission  configurations  (single  and  multiple  channels)  used  in 
these  tests.  In  the  single  channel  experiment,  one  port  and  one  substream  was  selected  and  configured  with  the 
following  parameters: 


Substream 


AALType: 
PRBS  Type: 
GFC  Range: 
VPI  Range: 
VCI  Range: 
PT  Range: 
CLP  Range: 
Cell  Rate: 


Test  Cells 
29  - 1 
Oh-fh 
OOh-ffh 
OOlOh 
0-3 
0-1 

96,000  cells/s 


The  single  channel  payload  data  stream  was  set  to  generate  96,000  ceils  per  second.  This  is  the  maximum  cell 
generation  capacity  of  the  AX/4000.  All  other  ports  and  substreams  (substream  2  to  16)  are  disabled. 
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For  the  multiple  channel  case,  one  port  and  six  substreams  (substream  1  to  6  )  are  selected  and  configured  as 
shown  in  Table  1.  Substreams  7  to  16  are  disabled.  Table  1  shows  that  substreams  1  &  2.  substreams  3  &  4, 


Table  1:  Multiple  Channel  Configuration 

PORT  1  SETUP 

Substream  1 

AAL  Type: 

Test  Cells 

PRBS  Type:  2’-l 

Substream  2 

AAL  Type: 

Test  Cells 

PRBS  Type:  25-l 

Substream  3 

AAL  Type: 

Test  Cells 

PRBS  Type:  2*-l 

Substream  4 

AAL  Type: 

Test  Cells 

PRBS  Type:  2’-I 

Substream  5 

AAL  Type: 

Test  Cells 

PRBS  Type:  29-l 

Substream  6 

AAL  Type: 

Test  Cells 

PRBS  Type:  29-l 

GFC  Range:  Oh-flt 

VPI  Range:  OOh-filt 
VC1  Range:  000 lit 

PT  Range:  0-7 

CLP  Range:  0  - 1 

GFC  Range:  Oli-fh 

VPI  Range:  OOh-ffli 
VCI  Range:  0002h 

PT  Range:  0-7 

CLP  Range:  0  -  1 

GFC  Range:  Oh-flt 

VPI  Range:  OOh-ffh 
VCI  Range:  0003b 

PT  Range:  0-7 

CLP  Range:  0  -  1 

GFC  Range:  Oh-flt 

VPI  Range:  OOh-ffli 
VCI  Range:  0004h 

PT  Range:  0  -  7 

CLP  Range:  0  - 1 

GFC  Range:  Oli-fh 

VPI  Range:  OOh-flh 
VCI  Range:  0005h 

PT  Range:  0-7 

CLP  Range:  0  - 1 

GFC  Range:  Oh-flt 

VPI  Range:  OOh-ffli 
VCI  Range:  0006h 

PT  Range:  0-7 

CLP  Range:  0-1 

Cell  Rate: 

150  Cells/s 

Cell  Rate: 

150  Cells/s 

Cell  Rate: 

604  Cells/s 

Cell  Rate: 

604  Cells/s 

Cell  Rate: 

3640  Cells/s 

Cell  Rate: 

3640  Cells/s 

substreams  5  &  6,  have  ATM  Cell  generation  rates  of  150  Cells/s,  604  Cells/s,  and  3640  Cells/s  respectively.  The 
combined  Cell  Rate  should  be  less  or  equal  to  96,000  Cells/s.  In  our  own  case,  for  multiple  channel  configura¬ 
tion,  the  combined  Cell  Rate  is  8788  Cells/s.  In  either  the  single  substream  or  multiple  substreams  configuration, 
the  PRBS  errors  are  triggered  by  the  Cell  Sequence  Error,  Cell  Payload  Bit  Error  or  the  Loss  of  Payload  Pattern 
Synchronization. 

Using  the  AdTech  AX/4000,  it  is  possible  to  measure  a  lot  of  information  for  each  test  transmission.  These  are 
known  as  stream  and  substream  statistics.  A  sample  of  the  measured  statistics  using  the  AX/4000  analyzer  mod¬ 
ule  is  shown  in  Table  2.  This  table  shows  a  sample  of  the  statistics  for  QPSK  experiment  conducted  on  24  My 
1995,  and  the  8PSK  experiment  conducted  on  11  August  1995.  Both  experiments  are  for  multiple  channel  (or 
substream)  tests,  with  only  the  first  substream  shown.  Similar  statistics  was  obtained  for  the  single  channel  tests. 


4.0  ANALYSIS  OF  RESULT 

The  key  issues  regarding  the  reliable  transmission  of  ATM  signal  via  satellite  is  the  subject  of  this  test,  especially, 
the  error  characteristic  and  propagation  delay.  This  is  because  in  satellite  transmission  at  the  Ka-band,  the  satel¬ 
lite  link  is  susceptible  to  atmospheric  effects,  the  most  important  of  which  is  attenuation  due  to  rain.  To  this  ef¬ 
fect,  the  weather  condition  at  RL  during  each  period  of  an  experiment  is  recorded  and  is  enclosed  in  the  ap¬ 
pendix.  The  reason  for  this  is  to  observe  any  relationship  between  the  collected  data  and  the  weather  condition. 
We  observed  that  on  several  conditions,  the  satellite  link  between  RL  and  CRC  was  lost  whenever  there  is  a  rain- 
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Table  2:  ATM  Transmission  Stream  and  Substream  Statistics 

l - 

QPSK  Modulation 

8PSK  Modulation 

Stream  Statistics: 

Stream  Statistics: 

Aggregate  Count:  48343 

Aggregate  Count:  17114 

Uncorrected  Header  Count:  915 

Uncorrected  Header  Count:  3 

Aggregate  Cell  Rate:  70,113  cells/s 

Aggregate  Cell  Rate:  22,240  cells/s 

Aggregate  Cell  Transfer  Capacity:  29.728  Mb/s 

Aggregate  Cell  Transfer  Capacity:  9.430  Mb/s 

Bandwidth  Percentage:  73.03  % 

Bandwidth  Percentage:  23.17  % 

Substream  Statistics: 

Substream  Statistics: 

Cell  Count:  177695  cells 

Cell  Count:  556180  cells 

Cell  Rate:  149.8  cells/s 

Cell  Rate:  150.0  cells/s 

Cell  Transfer  Capacity:  0.064  cells/s 

Cell  Transfer  Capacity:  0.064  cells/s 

CLP=1  Cell  Count:  41  cells 

CLP=1  Cell  Count:  0  cells 

CLP=1  Error  Ratio:  2.31E-04 

CLP=1  Error  Ratio:  0.00E+00 

Cell  Loss  Count:  124  cells 

Cell  Loss  Count:  9  cells 

Cell  Loss  Ratio:  0.000697495 

Cell  Loss  Ratio:  1.61815e-05 

Misinsertion  Cell  Count:  40  cells 

Misinsertion  Cell  Count:  0  cells 

Cell  Misinsertion  Rate:  0.0337286  cells/s 

Cell  Misinsertion  Rate:  0  cells/s 

Out-of-Sequence  Count:  0  events 

Out-of-Sequence  Count:  0  events 

Errored  Cell  Count:  85  cells 

Errored  Cell  Count:  93  cells 

Cell  Error  Ratio:  0.000478348 

Cell  Error  Ratio:  0.000167212 

PRBS  Bit  Error  Count:  895  bits 

PRBS  Bit  Error  Count:  215  bits 

PRBS  Bit  Error  Rate:  1.65682e-05  errors/bit 

PRBS  Bit  Error  Rate:  1.2716e-06  errors/bit 

PRBS  Sync  Error  Count:  0  resyncs 

PRBS  Sync  Error  Count:  0  resyncs 

storm  in  RL,  CRC  or  in  between.  This  observation  is  consistent  with  theoretical  analysis  which  has  shown  that 
rain  attenuation  severely  affects  transmission  at  Ka-bands. 

To  properly  assess  the  viability  of  ATM  via  satellite,  the  cumulative  effect  of  the  satellite  link  characteristics  on 
the  ATM  parameters  must  be  determined.  Performance  consideration  for  ATM  are  currently  based  on  the  as¬ 
sumption  that  transmission  bit  errors  are  randomly  distributed.  This  may  be  true,  to  a  high  degree  of  accuracy  for 
most  terrestrial  microwave  and  fiber  optic  based  transmission  systems.  However,  the  validity  of  this  assumption 
has  not  been  verified  for  satellite  ATM  Cell  bearers.  The  bursty  nature  of  transmission  errors  via  the  satellite  re¬ 
quires  careful  evaluation. 

4. 1  Channel  Characterization 

For  satellite  transmission  channels,  ATM  performance  parameters  can  only  be  quantified  if  the  nature  of  the 
transmission  bit  errors  is  first  characterized.  The  characterization  phase  of  this  experiment  was  aimed  at  achiev- 
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ing  this  objective. 


The  entire  raw  data  collected  during  this  experiment  can  be  obtained  from  the  satcom  division,  RL/C3BA,  Rome 
Laboratory,  New  York.  Some  of  these  data  is  presented  here  for  the  purposes  of  discussion. 

Most  of  the  data  presented  here  are  collected  from  the  RL  test  site.  The  other  half  collected  from  CRC  can  also  be 
obtained  from  RL/C3BA  or  from  CRC.  For  brevity,  this  other  half  is  not  presented  here. 

The  graph  of  Eb/N0  versus  BER  for  the  two  modulation  schemes,  with  or  without  RS  encoding  is  shown  in  Fig¬ 
ures  2  and  3.  In  Figure  4  and  5,  the  performance  of  QPSK  and  8PSK  are  compared.  The  effect  of  using  RS  en¬ 
coding  is  shown  in  Figure  6,  while  the  characteristics  of  both  the  forward  (RL  to  CRC)  and  backward  (CRC  to 
RL)  links  are  shown  in  Figure  7.  The  actual  experiment  represented  by  each  graph  is  also  indicated  on  the  plot. 
The  following  observation  can  be  made  from  these  plots. 

•  QPSK  provides  better  performance  than  8PSK.  This  result  could  be  attributed  to  the  fact  that  the  EF 
Data  modem  is  more  stable  with  respect  to  QPSK  than  the  8PSK.  In  other  words,  it  is  much  simpler  for 
the  EF  Data  modem  to  distinguish  between  four  phases  as  opposed  to  eight  phases.  Thus,  signal  detec¬ 
tion  is  more  reliable  at  QPSK  modulation  than  at  8PSK  modulation. 

•  RS  encoding/decoding  provides  considerable  system  performance  improvement.  For  example  in 
QPSK  at  BER  =  10"6,  there  is  a  power  saving  of  as  much  as  1.5  dB  with  the  use  of  RS  coding.  The  RS 
coding  gain  increases  for  lower  BER  values.  This  is  consistent  with  expected  analytical  values. 

•  From  Figure  7,  it  is  observed  that  RL  to  CRC  link  and  the  reverse  link  have  identical  transmission  link 
characteristics.  The  noticeable  variation  could  be  attributed  to  different  antenna  gain  at  both  test  sites, 
and  to  the  different  ACTS  transponders  (or  spot  beams)  used  for  both  test  sites. 

4.2  ATM  Transmission 

The  bursty  nature  of  the  error  statistics  and  the  corresponding  large  propagation  delay  involved  in  a  satellite  link 
affect  ATM  transmission  performance  [6],  A  number  of  ATM  parameters  have  been  identified  as  being  very  im¬ 
portant  in  the  assessment  of  the  performance  of  an  ATM  network.  Some  of  the  most  important  QoS  parameters 
include  the  transmission  channel  bit  error  ratio  (BER),  cell  loss  ratio  (CLR),  errored  cell  count  (ECC)  and  the 
PRBS  parameters.  In  the  experiments,  emphasis  was  placed  on  those  parameters  and  statistics  that  can  be  mea¬ 
sured  by  the  AdTech  ATM  test  equipment.  Some  of  these  parameter  is  discussed  bellow. 

•  Bit  Error  Rate  (BER):  This  is  the  rate  at  which  the  transmitted  bits  were  changed  in  the  ATM  physical 
layer. 
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•  Cell  Loss  Ratio  (CLR):  The  ratio  of  the  number  of  lost  ATM  cells  sent  by  a  user  in  specified  time  inter¬ 
val.  Due  to  the  random  nature  of  the  ATM  Cells,  the  limited  size  of  the  ATM  traffic  and  the  limited 
size  of  the  buffers,  it  is  usually  possible  that  a  cell  arriving  at  a  switching  node  may  be  lost .  Thus  CLR 
are  caused  by  buffer  overflows  and  bit  error  in  the  cell  header  that  can  be  detected  but  not  be  corrected. 

•  Cell  Misinsertion  Ratio  (CMR):  Defined  as  the  ratio  of  the  cells  delivered  to  a  wrong  destination  to  the 
total  number  of  cells  sent.  It  occurs  as  a  result  of  an  undetected  error  in  the  header  that  causes  a  change 
of  the  cell  destination. 

4.2.1  Single  Channel 

Table  A1  in  the  appendix  shows  a  sample  of  raw  data  collected  for  single  channel  ATM  transmission.  While 
many  signal  channel  runs  were  made,  a  selected  number  of  them  are  plotted  in  Figure  8  to  13  to  illustrate  the  re¬ 
sults  obtained.  The  particular  experiment  and  the  date  performed  are  indicated  on  the  plots.  From  these  Figures 
the  following  observations  may  be  made: 

•  CLR  and  CER  appear  to  have  similar  relations  with  channel  BER  They  both  decrease  as  BER  de¬ 
creases. 

•  CLC  and  ECC  are  lowest  at  BER  =10'5  for  the  range  of  BER  =10"°  to  10 4. 

•  For  the  received  ATM  alarms,  it  is  observed  that  a)  Framing  rate  appears  to  take  its  highest  value  at 
BER  =  10’5,  and  takes  its  minimum  value  in  the  neighborhood  of  10-6,  and  b)  FEBE  and  P-Bit  rate  ap¬ 
pear  to  increase  steadily  with  increasing  BER. 

•  For  the  PLCP  alarms,  the  FEBE  rate  and  P-Bit  rate  appear  to  have  similar  relations  with  channel  BER. 

•  The  forward  and  backward  satellite  channels  between  RL  and  CRC  appear  to  be  symmetric  with  identi¬ 
cal  measured  ATM  parameters. 

4.2.2  Multiple  Channel 

A  sample  of  the  raw  data  collected  for  the  multiple  channel  ATM  transmission  is  shown  as  Table  A2  in  the  ap¬ 
pendix.  As  many  as  six  ATM  substreams  of  varying  cell  rates  were  combined  in  the  multiple  channel  case  with 
cell  generating  rate  of  8788  Cells/s.  As  shown  in  Table  A2,  two  types  of  measured  parameters  may  be  distin¬ 
guished.  One  set  has  to  do  with  he  alarms  that  are  generated  for  the  composite  stream,  mainly,  received  frame 
alarm  and  received  PLCP  alarm.  The  other  set  of  are  the  ATM  Cell  characteristics  of  each  of  the  individual  sub- 
streams  as  shown  in  Table  2.  Figures  14  to  16  show  the  multichannel  ATM  parameters  as  function  of  channel 
BER.  The  following  observations  may  be  made  regarding  the  Figures. 

•  Substreams  with  similar  configurations  exhibit  similar  characteristic  with  respect  to  the  ATM  QoS  pa- 


rameters. 

•  The  received  ATM  alarms,  FEBE,  P-Bit.  and  Framing  rates  exhibit  similar  functional  relationship  with 
the  channel  BER. 

•  The  effect  of  RS  coding  on  the  ATM  transmission  was  not  evident.  However,  it  is  conjectured  that  the 
use  of  RS  coding  will  improve  performance  considerably. 

6.0  CONCLUSION 

In  this  experiment,  we  have  successfully  transmitted  and  received  ATM  Cells  between  Rome  Laboratory,  GAFB, 
New  York  and  CRC,  Canada.  These  ATM  tests  were  performed  at  Ka-band  frequencies  over  the  ACTS.  Two 
transponders  (or  beams)  -  the  East  Scan  4E  spot  beams  and  the  Steerable  spot  beam,  were  used  and  the  signal 
modulation  schemes  employed  were  the  QPSK  and  8PSK.  It  is  observed  in  our  experiment  that  the  modulation 
scheme  used  affects  the  stability  of  the  received  signal.  Also,  during  these  experiments,  we  observed  that  rain  at¬ 
tenuation  has  a  dramatic  effect  in  satellite  transmissions  at  Ka-band  frequencies.  Ways  of  mitigating  this  effect 
should  be  the  focus  of  more  studies. 

Although  we  were  able  to  transmit  and  receive  ATM  Cells  at  Ka-band,  it  will  be  premature  to  draw  a  conclusion 
on  the  performance  of  ATM  transmission  over  satellite  as  a  result  of  one  set  of  experiments.  It  is  strongly  recom¬ 
mended  that  more  experiments  be  conducted  after  which  a  more  meaningful  conclusion  on  the  performance  of 
ATM  signals  over  satellite  can  be  reached.  Some,  if  not  all,  of  the  test  performed  in  this  experiment  should  be  re¬ 
peated  in  other  to  verify  the  validity  of  these  test  results. 
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Figure  3.  8PSK  Channel  Characterization  without 
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Figure  4.  QPSK  and  8PSK  Channel  Characterization 
without  Reed  Solomon  Encoding 


Figure  5.  QPSK  and  8PSK  Channel  Characterization 
without  Reed  Solomon  Encoding 


Appendix 

A:  Weather  Conditions  and  Schedule  of  Experiments 


# 

Date 

Time 

Weather 

Comments 

1. 

27  June  95  (Tuesday) 

1815-1930  EDT 

Sunny 

Terminal  Configuration/Setup 

2. 

29  June  95  (Thursday) 

1230-1630  EDT 

Sunny 

Modem  Configuration 

3. 

5  July  95  (Wednesday) 

1200-1600  EDT 

Hazy 

Modem  Configuration 

6  July  95  (Thursday) 

0800-1100  EDT 

Hazy  &  Hot 

C/No  Measurement 

7  July  95  (Friday) 

1300-1545  EDT 

Hazy  &  Hot 

BER,  Eb/No  measurement  (QPSK) 

6. 

10  July  95  (Monday) 

1045-1430  EDT 

Cloudy 

CW  measurement 

7. 

14  July  95  (Friday) 

1030-1400  EDT 

Clear 

8-PSK/ADTECH 

8. 

17  July  95  (Monday) 

1045-1430  EDT 

Sunny 

ATM  (QPSK,  No  RS) 

9. 

21  July  95  (Friday) 

0730-1230  EDT 

Cloudy 

ADTECH  problem 

10. 

24  July  95  (Monday) 

1045-1430  EDT 

Cloudy 

ADTECH  problem  (QPSk,  No  RS) 

11. 

27  July  95  (Thursday) 

1430-1700  EDT 

Cloudy 

ADTECH  problem  (QPSk,  No  RS) 

12. 

3  Aug.  95  (Thursday) 

1100-1500  EDT 

Overcast 

Modem  problem  -  no  data 

13. 

4  Aug.  95  (Friday) 

1100-1500  EDT 

Cloudy 

Modem  problem  -  no  data 

14. 

7  Aug.  95  (Monday) 

11 15-1530  EDT 

Sunny 

RL  loop-back  test-Modem  problem 

15. 

11  Aug.  95  (Friday) 

0900-1300  EDT 

Cloudy 

Modem  Configuration 

16. 

14  Aug.  95  (Monday) 

0800-1200  EDT 

Sunny 

ATM  (8-PSK,  no  RS.  6  substreams) 

17. 

16  Aug.  95(Wednesday) 

0800-1100  EDT 

Sunny 

ATM  (8-PSK.  no  RS,  6  substreams) 

18. 

20  Aug.  95  (Sunday) 

1500-1930  EDT 

Sunny 

RL  loop-back  (ATM,  QPSK) 

19. 

22  Aug.  95  (Tuesday) 

0915-1300  EDT 

Sunny 

ATM  (QPSK,  No  RS,  6  Streams) 

20. 

25  Aug.  95  (Friday) 

0800-1100  EDT 

Sunny 

ATM  (QPSK,  No  RS,  6  Streams) 
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The  research  conducted  during  the  1 995  summer  faculty  research  program  at  Hanscom 
Air  Force  Base,  MA,  in  the  laboratory  of  Dr.  Charles  Woods  (RL/EROP)  consisted  of  several 
topics.  Each  topic  will  be  discussed  separately.  It  should  be  noted  that  I  do  not  type,  therefore 
written  manuscripts  are  being  typed  at  Tufts  University  for  publication.  Additionally  the 
manuscripts  are  very  mathematical  but  an  attempt  will  be  made  to  describe  the  contents  and  their 
possible  applications.  The  topics  are: 

Ultimate  Limits  on  Spatial  Light  Modulator  Performance  for 
Adaptive  Optics  in  Turbulent  Atmospheres 
Adaptive  optics  to  correct  atmospherically  induced  wavefront  aberrations  that  degrade 
imaging  system  performance  is  still  a  topic  of  current  interest  to  the  Air  Force.  The  basic  ideas 
(call  them  collectively,  the  strategy)  behind  adaptive  optics  is  now  over  25  years  old,  and  it  can 
be  said  that  there  is  very  little  disagreement  among  investigators  as  to  what  must  be  done. 

Rather  the  tactics  (i.e.,  the  actual  carrying  out  of  the  necessary  mathematics,  physics  and 
engineering)  are  still  undergoing  change  mostly  because  sensor  technology  has  undergone  rapid 
advances  and  computers  have  also  been  speeded  up  over  the  years.  These  two  factors  combine  to 
open  up  new  tactics  for  adaptive  optics  that  were  virtually  impossible  even  a  few  years  ago. 

To  this  end.  Dr.  Charles  Woods  (my  focal  point  at  Rome  Labs,  Hanscom  AFB)  has 
requested  that  I  initiate  (in  conjunction  with  him)  a  research  effort  to  determine  the  performance 
of  spatial  light  modulators  (SLMs)  for  possible  use  as  wavefront  correctors.  It  should  be  noted 
that  I  have  had  considerable  experience  in  various  aspects  of  adaptive  optics  having  been  funded 
several  times  by  Rome  Air  Development  Center  in  the  late  1970's  and  through  the  1980's  dealing 
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mostly  with  the  development  of  numerical  techniques  for  adaptive  optics,  see  for  example 
R.  Barakat  and  G.  Newsam,  "Algorithms  for  reconstruction  of  partially  known, 
bandlimited  Fourier  transform  pairs  from  noisy  data",  J.  Opt.  Soc.  ,4m.  A2, 2027-2039 
(1985).  Contains  references  to  much  of  the  previous  work  of  B.  and  N.  appearing  in 
applied  mathematics  journals. 

as  well  as  problems  with  adaptive  optics  per  se,  see  for  example 

P.  Nisenson  and  R.  Barakat,  "Partial  atmospheric  correction  with  adaptive  optics",  J.  Opt. 
Soc.  Am.  A4,  2249-2253  (1987). 
and 

R.  Barakat  and  B.  Sandler,  "Determination  of  the  wavefront  aberration  function  from 
measured  values  of  the  point  spread  function:  a  nonlinear  two  dimensional  phase  retrieval 
problem",  J.  Opt.  Soc.  Am.  9A,  1715-1723  (1990). 

Previous  technology  employed  various  acousto-optical  devices,  and  although  results  were 
excellent  it  is  of  some  importance  to  employ  new  (and  hopefully)  better  methods.  Although 
there  are  a  vast  number  of  papers  on  the  theory  and  application  of  adaptive  optics,  tor  our 
purposes  two  relatively  old  reviews  are  still  of  paramount  importance  for  the  SLM  problem: 

J.  Hardy,  "Adaptive  optics:  a  new  technology  for  the  control  of  light",  Proc.  IEEE,  66, 
651-697(1978). 

J.  Pearson,  R.  Freeman,  and  H.  Reynolds,  "Adaptive  optical  techniques  for  wavefront 
correction",  in:  R.  Shannon  and  J.  Wyant  (eds.),  Applied  Optics  and  Optical  Engineering, 
Vol.  7  (Academic  Press,  New  York,  1979)  pp.  246-340. 

The  second  is  especially  useful  because  it  deals  exhaustively  with  the  acousto-optical  device 
issue. 
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The  SLM  problem  is  formulated  in  the  following  fashion.  Let  W(p,q)  be  the  spatially 
random  wavefront  (induced  by  the  turbulent  atmosphere)  in  the  exit  pupil  of  the  optical  system. 
A  thin  plate  mirror  made  up  os  contiguous  SLMs  is  governed  by  a  "flexibility"  matrix,  F,  which 
defines  the  mirror  deflection,  D,  at  points  over  the  mirror  due  to  the  array  of  SLMs,  S.  Thus 

D  =  FS 

in  matrix  form.  Our  problem  is  to  solve  for  S  in  terms  of  a  given  (measured  D)  and  a  given 
(measured)  F,  so  that  in  symbolic  form 

S  =  F-'D 

The  actual  inversion  of  S  is  quite  complicated  and  involves  singular  value  decomposition  and  the 
corresponding  pseudo-inverse.  The  actual  details  will  be  written  up  in  the  near  future. 

Now  D  and  W  are  directly  related 

D  =  cW 

where  c  is  a  numerical  constant.  Consequently 

S  =  cF'W 

This  equation  should  be  understood  in  the  sense  that  since  W  is  a  sample  realization  of  the 
random  wavefront,  then  so  is  the  solution  S  a  sample  realization. 

The  question  now  arises  as  to  the  numerical  generation  of  the  two-dimensional,  zero- 
mean,  Gaussian  random  process  with  prescribed  covariance  function  (a  very  difficult  problem). 
Fortunately  an  algorithm  to  perform  this  complicated  task  has  already  been  worked  out,  see: 

R.  Barakat  and  J.  Beletic,  "Influence  of  atmospherically  induced  random  wavefronts  on 
diffraction  imagery:  a  computer  simulation  model  for  testing  image  reconstruction 
algorithms",  J.  Opt.  Soc.  Am.  7  A,  653-671  (1990). 

1  have  investigated  the  use  of  the  IBM  parallel  supercomputer  on  Maui  (AMOS  project)  to  carry 


out  the  needed  manipulations.  The  previous  calculations  were  carried  out  on  the  vonNuemann 
supercomputer  at  Princeton  University  (now  permanently  closed).  However  I  have  spent  time 
during  the  summer  in  ways  of  speeding  up  the  algorithm  by  a  factor  of  our  over  the  original 
approach. 

The  only  remaining  problem  we  have  is  to  specify  the  SLM's;  in  continuing  discussions 
with  Dr.  Woods  we  have  settled  on  a  few  SLMs  currently  available  (or  soon  to  be  available)  as 
candidates.  Future  work  will  depend  upon  a  Summer  Research  Extension  Program  grant  for 
1996. 


Log  Intensity  Statistics  of  Optical  Wave  Propagation  in  a  Turbulent  Atmosphere 

Several  years  ago,  I  developed  a  theory  of  weak  scattering  of  laser  beams  in  atmospheric 
turbulence  with  direct  application  to  LIDAR.  optical  imaging  issues,  and  optical  communication: 
R.  Barakat,  "Weak  scatter  generalization  of  the  K-density  function  with  application  to 
laser  scattering  in  atmospheric  turbulence",  J.  Opt.  Soc.  Am.,  3 A,  401-409  (1986). 

R.  Barakat,  "Weak  scatter  generalization  of  the  K-density  function,  II  probability  density 
of  total  phase",  J.  Opt.  Soc.  Am.  4A,  1213-1219  (1987). 

We  now  wish  to  extend  the  theory  in  terms  of  the  logarithm  of  the  intensity  (log-intensity) 
because  many  of  the  field  measurements  are  carried  out  in  terms  of  the  log-intensity  and  it  would 
be  desirable  to  have  expressions  for  the  probability  density  and  moments,  particularly  the 
variance,  in  terms  of  the  log-intensity.  Under  last  summer's  research  program,  I  began  the 
analysis  and  was  able  to  essentially  complete  it  during  this  summer.  Only  some  additional 
numerical  calculations  are  needed  to  complete  the  manuscript  (and  they  are  being  completed  at 
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Tufts  University).  The  work  has  been  summarized  in  a  manuscript  entitled: 

"Log-intensity  statistics  for  scintillation  described  by  generalized  K-distributions" 

The  manuscript  is  being  typed  at  Tufts  University  for  publication  in  an  archival  journal,  most 
likely  the  Journal  of  the  Optical  Society  of  America. 

The  results  of  this  work  permit  a  considerable  generalization  of  the  theory  to  cover 
situations  that  are  inaccessible  to  the  strong  scatter  theories  because  the  weak  scatter  theory 
contains  two  independent  parameters,  the  strong  scatter  theories  only  one  independent  parameter. 
Although  we  cannot  go  into  detail  here,  a  figure  is  appended  which  shows  the  probability  density 
of  the  log-intensity  for  a  typical  weak  scatter  situation. 


L-OG-  XWTE-KrS  ir'f 
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Hankel  Transform  Quadrature  Algorithm 

This  development  of  numerical  algorithms  for  fast,  but  accurate  evaluation  of  Hankel 
transforms  occupied  some  of  my  attention  last  summer  and  I  felt  that  the  work  should  be 
completed  at  the  end  of  this  summer's  research  funding.  Hankel  transforms,  i.e., 


=  ^(P)Jn 


(rp)p  dp 


Where  Jn  is  the  Bessel  function  of  integral  order  (n  =  0,1,2,...)  and  A(p)  is  given,  function  appears 
in  a  wide  variety  of  problems  in  optics  such  as  signal  processing,  laser  resonators,  waveguide 
propagation,  etc.  in  most  applications  the  integral  must  be  evaluated  a  large  number  of  times  and 
with  reasonable  accuracy  (say  6  to  8  digits).  Thus  any  numerical  quadrature  algorithm  must  be 
efficient  and  accurate,  if  it  is  to  be  effective. 

The  present  algorithm  employs  a  change  of  variables  of  the  independent  variable  p  and 
the  dependent  variable  r  using  logarithms  to  rewrite  the  Hankel  transform  as  a  convolution 
integral.  I  have  worked  the  remaining  mathematical  details  needed  to  maintain  accuracy  in  the 
evaluation  of  a  key  component  of  the  algorithm,  the  gamma  function  of  complex  argument.  The 
function  A(p)  with  its  logarithmic  change  of  p  is  expanded  in  the  Whittaker  cardinal  function  (a 
type  of  sampling  expansion).  It  can  be  shown  that  the  resultant  expression  for  h(r)  has 
coefficients  that  can  be  evaluated  in  terms  of  the  Fourier  transform  of  the  gamma  function  of 
complex  argument  by  sing  the  Filon  quadrature  scheme  as  developed  in 

R.  Barakat,  "Calculation  of  integrals  encountered  in  optical  diffraction  theory",  in  R. 

Fruden  (ed.),  The  Computer  in  Optical  Research  (Springer-Verlag,  New  York,  1980). 

The  algorithm  has  two  virtues:  it  is  fairly  fast  and  quite  accurate  (I  estimate  that  8  digits  can  be 
routinely  obtained);  it  is  applicable  to  any  real  positive  integer  n.  The  actual  programming  of  the 
algorithm  will  be  carried  out  during  the  late  fall;  unfortunately  it  is  very  complicated  as  far  as 
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programming  is  concerned. 


Two  possible  uses  are: 

1 .  semiconductor  strip  lasers  have,  to  say  the  least,  very  unusual  mode  structures  that 
must  be  analyzed  accurately  if  their  capabilities  are  to  be  efficiently  utilized.  At  present,  there 
are  various  ad  hoc  and  hybrid  approximate  approaches  that  have  because  of  the  difficulty  in 
evaluating  the  radial  part  of  the  mode  structure.  I  expect  that  it  will  be  possible  to  entertain  a 
more  exact  approach  since  we  can  now  numerically  evaluate  the  integrals. 

2.  optical  signal  processing  operations  often  require  the  rapid,  accurate  numerical 
evaluation  of  Hankel  transforms.  Some  of  these  issues  (but  at  an  elementary  level)  are  discussed 
in 

V.  Das,  Optical  Signal  Processing  (Springer- Verlag,  New  York,  1991). 

In  summary,  the  algorithm  should  be  of  great  help  in  numerical  evaluation  of  Hankel  transforms 
that  occur  in  various  optical  contexts. 


Numerical  Modelling  Problems  in  Photoreffactive  Media 

A  method  of  achieving  lock-in  detection  using  the  photorefractive  effect  has  been 
developed  by  scientists  at  the  Rome  Laboratories,  Hanscom  AFB;  their  work  on  phase  sensitive 
detection  appears  to  offer  advantages  over  previous  attempts: 

J.  Khoury,  C.  Woods  and  M.  Cronin-Golomb,  "Photorefractive  time  correlation  motion 
detection",  Opt.  Comm.,  85  5-9  (1991). 


J.  Khoury,  V.  Ryan,  M.  Cronin-Golomb  and  C.  Woods,  "Photorefractive  frequency 

converter  and  phase  sensitive  detection",  J.Opt.  Soc.  Am.,  10B,  82-82  (1993). 

They  describe  a  technique  for  extracting  a  small  signal  phase  modulation  embedded  in  a  large 
noise  environment. 

The  final  form  of  their  analysis  contains  a  double  integral  whose  integrand  is  a  rapidly 
oscillating  function,  the  rapidity  of  the  oscillations  increasing  with  time.  They  were  able  to 
evaluate  the  double  integral  in  special  cases  using  various  analytic  techniques,  but  were  not  able 
to  study  more  general  situations  of  practical  interest.  During  the  summer,  I  investigated  ways  of 
evaluating  the  double  integral  numerically.  The  problem  is  very  difficult  because  the  integrand 
is  so  highly  oscillatory  in  that  conventional  integration  schemes  such  as  Simpson  or  Gauss 
quadrature  are  ineffective. 

I  have  devised  an  integration  scheme  which  separates  the  integrand  thusly 
(integrand)  =  (slowly  varying  component)  x  (rapidly  varying  exponential) 

The  slowly  varying  component  is  approximated  by  a  quadratic  function  over  one  of  the  2-D  cells, 
while  the  rapidly  varying  exponential  is  left  alone.  By  various  elementary  (but  extremely 
tedious)  manipulations  one  can  obtain  a  quadrature  formula  for  the  double  integral  for  which  the 
error  induced  is  essentially  independent  of  the  time.  Although  the  mathematical  manipulations 
are  (as  noted)  cumbersome,  the  final  expression  is  quite  easy  to  program! 

Dr.  Charles  Woods  plans  to  program  the  double  integral  algorithm  sometime  this  fall  and 
we  plan  to  begin  examining  numerically  further  problems  of  signal  processing  that  are  too 
complicated  to  be  dealt  with  by  analytic  methods. 
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A  THEORY  OF  THE  STRUCTURE  OF  DOMAIN  THEORIES 

D.  Paul  Benjamin 
Assistant  Professor 
Computer  Science  Department 
Oklahoma  State  University 

Abstract 

Domain  theories  are  used  in  a  wide  variety  of  fields  of  computer  science  as  a  means  of  representing 
properties  of  the  domain  under  consideration.  These  fields  include  artificial  intelligence,  software  engi¬ 
neering,  VLSI  design,  cryptography,  and  distributed  computing.  In  each  case,  the  advantages  of  using 
theories  include  the  precision  of  task  specification  and  the  ability  to  verify  results.  As  the  complexity  of 
systems  in  these  fields  increases,  these  advantages  become  crucial.  A  great  deal  of  effort  has  gone  into 
the  development  of  tools  to  make  the  use  of  theories  easier.  This  effort  has  met  with  some  success.  How¬ 
ever,  a  fundamental  problem  remains:  the  choice  of  formulation  for  a  theory.  This  paper  describes  funda¬ 
mental  research  on  the  representation  of  domain  theories.  The  perspective  of  this  work  is  to  view  a 
problem’s  state  space  as  though  it  were  physical  space,  and  the  actions  in  the  state  space  as  though  they 
were  physical  motions.  A  domain  theory  should  then  state  the  laws  of  motion  within  the  space.  Follow¬ 
ing  the  analogy  with  physics,  a  representation  is  a  coordinate  system,  and  theories  are  transformed  by 
changing  coordinates.  The  mathematical  basis  for  this  analogy  is  given,  and  illustrated  on  two  simple 
examples. 
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A  THEORY  OF  THE  STRUCTURE  OF  DOMAIN  THEORIES 


D.  Paul  Benjamin 


Introduction 

Domain  theories  are  used  in  a  wide  variety  of  fields  of  computer  science  as  a  means  of  representing 
properties  of  the  domain  under  consideration.  These  fields  include  artificial  intelligence,  software  engi¬ 
neering,  VLSI  design,  cryptography,  and  distributed  computing.  In  each  case,  the  advantages  of  using 
theories  include  the  precision  of  task  specification  and  the  ability  to  verify  results.  As  the  complexity  of 
systems  in  these  fields  increases,  these  advantages  become  crucial.  A  great  deal  of  effort  has  gone  into 
the  development  of  tools  to  make  the  use  of  theories  easier.  This  effort  has  met  with  some  success.  How¬ 
ever,  a  fundamental  problem  remains:  the  choice  of  formulation  for  a  theory.  This  paper  describes  funda¬ 
mental  research  on  the  representation  of  domain  theories.  The  perspective  of  this  work  is  to  view  a 
problem’s  state  space  as  though  it  were  physical  space,  and  the  actions  in  the  state  space  as  though  they 
were  physical  motions.  A  domain  theory  should  then  state  the  laws  of  motion  within  the  space.  Follow¬ 
ing  the  analogy  with  physics,  a  representation  is  a  coordinate  system,  and  theories  are  transformed  by 
changing  coordinates.  The  mathematical  basis  for  this  analogy  is  given,  and  illustrated  on  two  simple 
examples. 

Theories  must  be  organized  so  that  appropriate  theories  can  be  efficiently  retrieved  in  new  situations. 
If  there  are  too  many  specialized  theories,  appropriate  theories  will  not  be  found  quickly  enough,  so  gen¬ 
eral  theories  are  needed.  But  overgeneralization  of  theories  causes  the  retrieval  of  theories  that  are  inap¬ 
propriate.  So  it  is  necessary  to  precisely  identify  the  class  of  problems  on  which  a  given  theory  can  be 
reused.  Removing  implementation  detail  from  the  representation  is  necessary  to  produce  as  general  a 
theory  as  possible,  which  can  be  reused  in  many  different  implementations.  Furthermore,  even  at  a  given 
level  of  generality,  a  theory  can  be  represented  in  a  large  number  of  different  ways  that  vary  in  their 
computational  effectiveness.  A  good  choice  of  notation  and  a  good  choice  of  formulation  within  that 
notation  are  absolutely  necessary  for  effective  use  of  theories.  A  simple  example  is  given  by  the  follow- 
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ing  two  representations  for  the  two-disk  Towers  of  Hanoi. 


x  =  Move  disk  d  one  peg  right 
(wrapping  around) 

x"1  =  Move  disk  d  one  peg  left 
(wrapping  around) 


a  =  Move  the  top  disk  on  peg  p  one 
peg  right  (wrapping  around) 

a'1  =  Move  the  top  disk  one  peg 
left  to  peg  p  (wrapping  around) 


Even  in  this  simple  puzzle  there  are  many  possible  formulations.  The  two  formulations  given  above 
differ  in  that  the  first  indexes  the  moves  according  to  the  disk  moved,  whereas  the  second  indexes  the 
moves  by  the  peg  moved  from  (or  to,  for  the  inverse  moves.)  The  advantage  of  the  first  formulation  is 
clear:  it  scales  up  to  theories  for  more  disks,  because  the  actions  x$  and  xl  will  be  included  unchanged  in 
those  theories.  New  moves  will  be  added  for  the  new  disks.  However,  the  actions  in  the  second  formula¬ 
tion  must  be  redefined  as  more  disks  are  added,  so  this  theory  does  not  scale  up.  The  result  is  that  analy¬ 
sis  and  synthesis  of  the  two-disk  problem  performed  using  the  first  formulation  can  be  reused  when 
larger  problems  are  attempted,  but  analysis  and  synthesis  from  the  second  formulation  must  be  discarded 
when  larger  problems  are  attempted. 

Computer  science  is  not  the  first  field  to  be  faced  with  the  problem  of  properly  formulating  theories. 
Throughout  the  history  of  science,  it  has  always  been  desirable  to  formulate  theories  in  as  general  a  way 
as  possible,  so  that  important  regularities  are  identified  and  separated  from  details  particular  to  individ¬ 
ual  situations.  In  particular,  physics  has  had  a  great  deal  of  success  in  formulating  theories  of  wide  gen¬ 
erality  yet  high  predictive  accuracy.  In  this  paper,  we  will  see  that  many  of  the  mathematical  structures 
employed  in  the  statement  of  physical  theories  can  be  usefully  generalized  to  the  statement  of  abstract 


theories. 


We  will  begin  with  a  brief  discussion  of  the  properties  of  physical  theories  in  the  next  two  sections. 
The  remainder  of  the  paper  discusses  the  appropriate  mathematics  for  analyzing  these  properties,  and 
gives  three  simple  examples  of  the  analysis  and  reformulation  of  theories. 

Invariants  of  Laws 

The  ability  to  formulate  any  law  of  nature  depends  on  the  fact  that  the  predictions  given  by  the  law, 
together  with  certain  initial  conditions,  will  be  the  same  no  matter  when  or  where  the  results  of  the  pre¬ 
dictions  are  observed.  In  physical  theories,  the  fact  that  absolute  time  and  location  are  never  relevant  is 
essential  for  the  statement  of  laws;  without  this  fact,  general  laws  could  not  be  stated,  and  the  complex¬ 
ity  of  the  world  would  eliminate  the  possibility  of  intelligent  comprehension  of  the  environment.  This 
irrelevance  is  stated  in  terms  of  the  invariance  of  laws  under  translation  in  time  and  space.  Such  invari¬ 
ance  is  so  self-evident  that  it  was  not  even  stated  clearly  until  less  than  a  century  ago.  It  was  Einstein 
who  recognized  the  importance  of  invariance  in  the  formulation  of  physical  law,  and  brought  it  to  the 
forefront  of  physics.  Before  Einstein,  it  was  natural  to  first  formulate  physical  law  and  then  derive  the 
laws  of  invariance.  Now,  the  reverse  is  true.  As  the  eminent  physicist  Wigner  states,  “It  is  now  natural 
for  us  to  try  to  derive  the  laws  of  nature  and  to  test  their  validity  by  means  of  the  laws  of  invariance, 
...“(Wigner,  1967,  p.5).  This  is  especially  clear  in  the  development  of  quantum  mechanics. 

Invariance  is  important  not  only  in  physics.  As  Dijkstra  states,  “Since  the  earliest  days  of  proving  the 
correctness  of  programs,  predicates  on  the  programs’s  state  space  have  played  a  central  role.  This  role 
became  essential  when  non-deterministic  systems  were  considered....  I  know  of  only  one  satisfactory 
way  of  reasoning  about  such  systems:  to  prove  that  none  of  the  atomic  actions  falsifies  a  special  predi¬ 
cate,  the  so-called’global  invariant’.”  (Dijkstra,  1985.)  In  other  words,  as  the  system  moves  in  its  state 
space,  the  global  invariant  is  a  law  of  motion.  Dijkstra  goes  on  to  point  out  the  central  difficulty  in  the 
use  of  invariants:  “That  solves  the  problem  in  principle;  in  each  particular  case,  however,  we  have  to 
choose  how  to  write  down  the  global  invariant.  The  choice  of  notation  influences  the  ease  with  which  we 
can  show  that,  indeed,  none  of  the  atomic  actions  falsifies  the  global  invariant.”  We  need  a  mathematics 
of  invariance  to  help  us  formulate  invariants. 

Symmetry 

A  symmetry  is  a  mapping  of  an  object  or  system  to  itself  such  that  the  result  of  the  mapping  is  indis¬ 
tinguishable  from  the  original.  For  example,  the  human  body  (idealized)  has  a  left-right  symmetry.  The 
following  square  has  a  number  of  symmetries,  including  flipping  it  onto  itself  about  the  lines  x  =  1/2  or 
y  =  1/2  or  x  =  y,  and  rotating  it  ninety  degrees  either  clockwise  or  counterclockwise: 
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Symmetries  can  exist  in  physical  space  or  in  state  space.  For  each  invariance,  there  is  a  correspond¬ 
ing  set  of  symmetries,  each  of  which  maintains  the  invariant.  For  example,  the  invariance  of  physical 
law  under  translation  in  space  corresponds  to  the  symmetries  of  space  under  all  translations.  Also,  the 
global  invariant  of  a  non-deterministic  program  corresponds  to  all  permutations  of  the  atomic  actions; 
each  permutation  maintains  the  invariant.  This  correspondence  holds  in  reverse,  also.  For  each  set  of 
symmetries,  there  is  a  corresponding  invariant. 

For  example,  the  square  above  can  be  represented  by  the  following  theory: 

x  =  0,  0  <  y  <  1 

x  =  1,  0  <y  <  1 
y=0,  0  <  x  <  1 
y  =  1,  0  <  x  <  1 

This  theory  has  syntactic  symmetries  corresponding  to  the  symmetries  of  the  square,  e.g.  interchang¬ 
ing  x  and  y  gives  the  flip  about  the  line  x  =  y,  and  interchanging  the  first  and  second  lines  of  the  theory 
flips  the  square  about  the  line  x  =  1/2. 

Many  of  the  important  symmetries  in  physics  are  geometric.  In  other  words,  they  are  symmetries  of 
the  space  in  which  motion  takes  place.  By  viewing  a  program  as  “moving”  in  its  state  space,  we  can  take 
the  same  approach  as  physicists:  formulate  geometric  symmetries  of  the  space,  and  use  them  to  derive 
invariants,  thereby  obtaining  laws  governing  the  use  of  the  program. 

The  Mathematics  of  Symmetry:  Groups  Theory 

Given  a  global  invariant,  the  corresponding  set  of  transformations  is  closed  under  composition,  and 
for  every  transformation,  its  inverse  is  also  a  transformation  that  preserves  the  invariant.  The  identity 
transformation  is  also  always  in  the  set.  Thus,  the  set  of  transformations  form  a  group.  Group  theory  is 
the  language  of  symmetries,  and  has  assumed  a  central  role  in  modern  physics.  Group  theory  can  also  be 
used  to  analyze  the  symmetries  of  a  task  and  derive  invariants,  which  are  then  used  to  synthesize  a  pro¬ 
gram.  The  following  example  is  given  in  Benjamin  (1994).  (This  paper  will  not  provide  any  background 
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in  group  theory.  The  reader  is  referred  to  any  standard  text.) 


Let  us  begin  by  examining  a  simple  task,  the  2x2x2  Rubik's  Cube  with  180-degree  twists  (we  use 
such  a  small  example  for  brevity  of  presentation,  but  the  techniques  are  generally  applicable,  as  will  be 
shown).  Let  the  8  cubicles  (the  fixed  positions)  in  the  2x2x2  Cube  be  numbered  in  the  following  way  (8 
is  the  number  of  the  hidden  cubicle): 


The  goal  configuration  for 
the  2x2x2  Rubik’s  Cube  with 
1 80-degree  twists. 


Number  the  cubies  (the  movable,  colored  cubes)  similarly,  and  let  the  goal  be  to  get  each  cubie  in  the 
cubicle  with  the  same  number.  For  brevity  of  presentation,  we  will  consider  only  180°  twists  of  the  cube. 
Let  f,  r,  and  t  denote  180°  clockwise  turns  of  the  front,  right,  and  top,  respectively  (cubie  8  is  held  fixed; 
Dorst  (1989)  shows  that  this  is  equivalent  to  factoring  by  the  Euclidean  group  in  three  dimensions).  Note 
that  this  cubie  numbering  is  just  a  shorthand  for  labeling  each  cubie  by  its  unique  coloring.  This  holds 
true  for  the  Cube  with  only  180°  twists,  as  position  determines  orientation. 

The  actions  for  the  Cube  can  be  represented  as  a  group,  which  is  generated  by  the  actions  f,  r,  and  t. 
We  use  group  representation  theory  to  represent  f,  r,  and  t  as  matrices: 


0  0  1  0  0  0  0 

1  0  0  0  0  0  0 

0  1  0  0  0  0  0 

0  1  0  0  0  0  0 

0010000 

1  0  0  0  0  0  0 

1  0  0  0  0  0  0 

0  1  0  0  0  0  0 

0  0  1  0  0  0  0 

0  0  0  0  0  10 

r  = 

0  0  0  0  0  0  1 

t  = 

0  0  0  0  1  0  0 

0000100 

0  0  0  0  1  0  0 

0  0  0  1  0  0  0 

0001000 

0  0  0  0  0  1  0 

0  0  0  0  0  1  0 

0  0  0  0  0  0  1 

0001000 

0  0  0  0  0  0  1 

These  matrices  are  7-dimensional,  corresponding  to  the  7  unsolved  cubies.  We  find  eigenvectors  of 
eigenvalue  1;  these  are  the  invariants.  Any  invariant  of  all  the  actions  is  irrelevant  and  can  be  projected 
away.  To  do  this,  we  first  change  the  coordinate  system  so  that  the  invariant  eigenvectors  are  axes,  and 
then  project  to  the  noninvariant  subspace,  removing  all  irrelevant  information  at  once.  In  this  case,  the 
eigenvectors  are: 


0 

1 

» 

1 

0 

,  for  X  =  1,  and 

0 

-1 

for  A,  =  - 1  f: 

1 

0 

f 

0 

1 

,  for  X  =  1,  and 

-1 

0 

1 

0 

_1_ 

1 

0 

1_ 
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1 

0 

-1 

1 

1 

. 

0 

,  for  X  =  1 ,  and 

1 

for  A,  =  -1  and  the  common  invariant  eigenvector  is: 

1 

0 

_1_ 

0 

1 

Note  that  we  have  shortened  these  eigenvectors  to  save  space;  they  are  actually  7-vectors,  with  addi¬ 


tional  zeroes.  We  then  change  the  basis.  The  appropriate  matrix  is: 


P  = 


73  72  76 

73  °  72  76 

1  -2 

1  till  Q  II  ItQ  " 

73  76 

0^00 

2 

0  i  0  0 

2 

0^00 

2 

0^00 

2 


""0 

„„0 

0 

1 

2 

-1 

2 

-1 

2 

1 

2 


0  ""0 


,.„0  ..„0 


0  0 

2  1 

2  2 

-1  1 

2  2 

1  -2 

2  2 

-1  -1 

2  2 


yielding  the  new  representations  for  r,  f,  and  t: 


1  0 

0 

0 

0 

0 

0 

1  0 

0 

0 

0  0  0 

0  1 

0 

0 

0 

0 

0 

0  1 

0 

0 

0  0  0 

0  0 

1 

2 

73 

2 

0 

0 

0 

0  0 

1 

2 

73 

2 

0  0  0 

0  0 

73 

1 

0 

0 

0 

f  = 

0  0 

73 

1 

0  0  0 

2 

2 

2 

2 

0  0 

0 

0 

1 

0 

0 

0  0 

0 

0 

0  0-1 

0  0 

0 

0 

0 

0 

-1 

0  0 

0 

0 

0  1  0 

0  0 

0 

0 

0 

-1 

0 

0  0 

0 

0 

-10  0 

1  0 

0 

0 

0 

0 

0 

0  1 

0 

0 

0 

0 

0 

0  0 

-1 

0 

0 

0 

0 

0  0 

0 

1 

0 

0 

0 

0  0 

0 

0 

0 

-1 

0 

0  0 

0 

0 

-1 

0 

0 

0  0 

0 

0 

0 

0 

1 

This  procedure  computes  the  irreducible  invariants  of  a  group.  The  irreducible  factors  of  dimension 
1,  1,  2,  and  3  are  found  along  the  diagonals  of  the  matrices.  Projecting  to  these  subspaces  yields  two 


subproblems: 


1  73 

1  73 

r  = 

2  2 

73  1 
_  2  2_ 

f  = 

2  2 

73  1 

2  2_ 

t  = 

II 

-1  0 
0  1_ 

On  cubelets  1,  2,  and  3,  the 
subgroup  generated  is 
{i.  r.  f.  t.  rt.  tr } 


1  0 

0 

0  0-1 

o 

1 

0  0 

-1 

f  = 

0  1  0 

t  = 

-10  0 

0  -1 

0_ 

-1  0  0 

o 

o 

On  cubelets  4,  5,  6,  and  7,  the 
subgroup  generated  is 
{ i,  r,  f,  t,  rf,  rt,  fr,  ft,  tr,  tf,  rfr,  rft,  rtr,  rtf, 
frt.ftr,  ftf,  trf,  tfr,  rfrt,  rftr,  rftf,  rtrf,  rtfr) 
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Using  each  set  of  matrices  as  generators,  we  get  two  subgroups  of  actions,  the  second  of  which  is  a 
faithful  representation  of  the  whole  group.  The  first  subgroup  moves  cubies  1,  2,  and  3,  while  holding  4, 
5,  6,  and  7  in  position.  The  second  subgroup  moves  cubies  4,  5,  6,  and  7  while  holding  1,  2,  and  3  in 
their  positions.  We  then  repeat  this  procedure  on  the  first  set  of  actions  to  obtain  a  full  set  of  prime  fac¬ 
tors  of  the  group. 


These  factors  can  be  assembled  in  different  ways  to  form  serial  algorithms.  There  is  more  than  one 
way  to  decompose  this  group,  analogous  to  the  different  ways  of  listing  the  prime  factors  of  a  number. 


Five  serial  algorithms  are  obtained  in  this  way.  We  examine  two  of  them. 

R  =  {i,  frtr}  °  {i,  rtft}  °  {i,  t}  0  {i,  r,  f} 


One  of  {  i,r,f  }  brings  cubelet  3  into  cubicle  3. 

One  of  {  i,t  }  brings  cubelet  1  and  2  into  their  places. 

One  of  {  i,  rtft  }  brings  (4,6)  and  (5,7)  in  the  proper  planes  ('the  front  face  looks  right'). 

One  of  {  i,  frtr  }  finishes  the  Cube. 

The  above  figure  is  read  right-to-left;  shaded  cubicles  have  been  solved,  “i”  denotes  the  identity 
(null)  action.  The  average  number  of  moves  required  to  solve  the  Cube  in  this  way  is  5.17. 

We  see  that  there  are  two  “subspaces”  of  cubies  in  this  Cube:  one  consisting  of  cubicles  1,2,3  and  8, 
and  the  other  of  cubicles  4,5,6,  and  7.  For  each  subspace,  there  is  a  subprogram  that  interchanges  the 
cubies  in  its  cubicles  while  holding  the  cubies  in  the  other  subspace  invariant.  We  are  thus  justified  in 
thinking  of  these  two  sets  of  cubicles  as  independent  subspaces.  Recognizing  the  symmetries  that  char¬ 
acterize  such  subspaces  is  essential  for  synthesizing  such  algorithms. 

Each  step  in  the  algorithm  brings  a  subset  of  cubies  to  its  goal  value.  Subsequent  steps  hold  that  pat¬ 
tern  of  cubies  invariant.  In  this  way,  a  divide-and-conquer  algorithm  is  synthesized.  For  example,  the 
first  step  solves  cubicle  3.  Knowing  the  colors  of  the  solved  cubicle  8,  we  know  the  colors  of  cubicle  3  - 
it  has  the  same  color  as  the  bottom  of  cubicle  8,  and  two  new  colors.  There  is  only  one  such  cubie,  and  it 
must  be  in  one  of  three  locations:  in  its  goal  position,  or  in  cubicle  1  or  2.  The  system  need  only  examine 
those  3  values  to  determine  what  action  to  take.  Once  cubicle  3  is  solved,  it  need  not  be  examined  again. 
The  system  next  solves  cubicles  1  and  2;  it  need  only  examine  either  position  to  see  if  the  proper  cubie 
is  there;  if  so,  it  does  nothing,  otherwise,  it  twists  the  top.  Finally,  the  system  uses  the  appropriate 
sequence  of  actions  to  solve  the  remaining  four  cubicles,  by  first  examining  the  front  face  to  see  if  it  is  a 
uniform  color,  and  then  examining  the  top  or  right  face  to  see  if  it  is  of  uniform  color. 
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We  have  transformed  the  global  7-dimensional  description  of  the  Cube  into  a  composition  of  local 
descriptions,  each  characterized  by  a  set  of  symmetries.  This  decomposition  has  average  cost  of  5.17, 
whereas  an  optimal  solution  is  of  average  length  2.46,  There  are  better  decompositions.  We  now  examine 
the  best  decomposition.: 

R  =  {i,  t}  °  {i,  r,  rt}  °  {i,  f,  ft,  fr} 

Serial  Algorithm  2: 

One  of  {  i,f,fr,ft  }  brings  cubelet  6  into  cubicle  6. 

One  of  {  i,r,rt  }  brings  3,7  in  place  (bottom  layer  correct). 

One  of  {  i,  t}  finishes  the  Cube. 

The  average  number  of  moves  to  solve  the  Cube  using  this  decomposition  is  2.75. 

Each  decomposition  can  be  thought  of  as  a  coordinate  system  whose  origin  is  the  goal  state.  For 
example,  the  second  serial  algorithm  can  be  thought  of  as  a  3-dimensional  coordinate  system  (a,b,c) 
where  a  is  in  { i,f,ft,fr } ,  b  is  in  { i ,r,rt } ,  and  c  is  in  { i,t } . 

From  the  Cube  example,  we  see  that  we  can  view  a  task  representation  as  a  coordinate  system  whose 
axes  are  the  components  of  the  task.  Using  group  representation  theory,  we  represent  the  actions  as 
matrices.  Changing  the  basis  so  that  invariant  eigenvectors  are  axes  re-expresses  the  task  space  so  that 
subtasks  are  coordinate  subspaces,  thereby  identifying  a  good  decomposition.  The  divide-and-conquer 
algorithm  transforms  the  group  of  the  problem  into  a  product  of  smaller  groups  (a  composition  series). 
This  is  done  by  factoring  the  group  at  each  step  by  an  irreducible  normal  subgroup,  yielding  a  quotient 
group.  This  procedure  is  similar  to  that  used  in  Galois  theory,  in  which  an  equation  is  solvable  by  extrac¬ 
tion  of  roots  iff  its  group  of  symmetries  is  solvable.  However,  the  situation  with  domain  theories  is  more 
general,  in  that  a  divide-and-conquer  algorithm  can  be  generated  even  if  the  transformation  group  is  not 
solvable,  because  all  that  we  require  is  that  the  decomposition'  yield  components  that  are  small  enough 
that  global  search  can  be  applied  efficiently  to  each  of  them. 


The  Mathematics  of  Local  Symmetries:  Inverse  Semigroups 

The  previous  section  illustrates  how  the  analysis  of  global  symmetries  can  be  useful  in  synthesis. 
However,  global  symmetries,  and  in  general  global  properties,  are  not  sufficiently  general.  We  must  also 
examine  local  symmetries,  which  are  symmetries  between  parts  of  a  system,  rather  than  of  the  whole 
system.  Local  symmetries  correspond  to  local  invariants,  which  are  predicates  that  hold  for  some  part  of 
a  system,  but  not  necessarily  for  the  whole  system.  An  example  is  a  loop  invariant  in  a  program,  which 


holds  during  the  execution  of  the  loop,  but  not  necessarily  elsewhere.  This  is  the  sort  of  invariant  more 
often  encountered  in  computing. 

To  handle  local  symmetries,  we  use  a  more  general  theory  than  group  theory:  the  theory  of  inverse 
semigroups.  A  semigroup  is  a  closed,  associative  system,  and  an  inverse  semigroup  has  the  additional 
property  that  every  action  is  invertible.  The  difference  between  a  group  and  an  inverse  semigroup  is  that 
the  transformations  in  the  group  must  be  globally  defined  (they  correspond  to  global  symmetries) 
whereas  those  in  the  inverse  semigroup  can  be  defined  on  only  part  of  the  space,  and  are  thus  partial 
functions  on  the  space.  Inverse  semigroups  are  thus  more  appropriate  for  reasoning  about  programming 
constructs,  e.g.,  rules,  which  can  be  defined  only  on  some  variable  bindings. 

In  physical  theories,  space-time  is  represented  in  terms  of  a  coordinate  system.  Invariance  under 
translation  in  time  or  space  then  becomes  invariance  under  coordinate  change.  Inverse  semigroups  pos¬ 
sess  a  similar  notion  of  coordinate  system,  and  we  use  this  in  the  same  way. 

We  consider  a  task  theory  M  =  (Q,,  A,  8)  consisting  of  a  set  A  of  actions  defined  as  partial  functions 
on  a  state  space  Q,  together  with  a  mapping  8:  Q  x  A  Q  that  defines  the  state  transitions.  We  are  not 
concerned  with  the  syntactic  details  of  the  encoding  of  the  actions  A,  but  rather  with  which  actions 
should  be  labeled  the  same  and  should  therefore  be  considered  instances  of  a  common  abstraction.  In 
other  words,  we  are  concerned  with  the  algebraic  structure  of  A.  A  task  is  specified  by  a  pair  (i,g),  where 
i:  1  — >  Q  maps  a  one-element  set  into  Q  identifying  an  initial  state,  and  g:  1  Q  identifies  a  desired 
state.  Without  loss  of  generality,  we  can  restrict  our  attention  to  semigroups  of  partial  1-1  functions  on 
the  state  space  Q  (Howie,  1976).  This  formalism  is  extremely  general.  It  encompasses  nondeterministic 
systems  and  concurrent  systems. 

The  description  of  a  system  for  the  synthesis  of  a  plan  (or  control)  for  M  differs  from  the  description 
of  M  in  at  least  one  essential  way:  the  process  of  planning  an  action  is  reversible  whether  or  not  the 
action  itself  is  reversible  (assuming  the  synthesizing  system  can  backtrack.)  Thus,  the  process  of  synthe¬ 
sizing  plans  can  be  described  by  a  theory  whose  actions  form  an  appropriate  inverse  semigroup  contain¬ 
ing  the  original  actions  together  with  newly  added  inverses  corresponding  to  backtracking.  As  this  paper 
is  primarily  concerned  with  system  synthesis,  we  will  analyze  only  inverse  semigroups  in  the  remainder 
of  this  paper. 

To  analyze  the  structure  of  such  a  semigroup  of  transformations,  a  usual  step  is  to  examine  Green's 
relations  (Lallement,  1979).  Green’s  equivalence  relations  are  defined  as  follows  for  any  semigroup  S: 

a  R  b  iff  aS1  =  bS1  a  L  b  iff  S‘a  =  S!b  a  J  b  iff  S^S1  =  S'bS1 
H  =  RnL  D  =  RL 

where  S!  denotes  the  monoid  corresponding  to  S  (S  with  an  identity  element  1  adjoined).  Intuitively, 
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we  can  think  of  these  relations  in  the  following  way:  aRb  iff  for  any  plan  that  begins  with  “a”,  there 
exists  a  plan  beginning  with  “b”  that  yields  the  same  behavior;  aLb  iff  for  any  plan  that  ends  with  “a”, 
there  exists  a  plan  ending  with  “b”  that  yields  the  same  behavior;  aHb  indicates  functional  equivalence, 
in  the  sense  that  for  any  plan  containing  an  “a”  there  is  a  plan  containing  “b”  that  yields  the  same  behav¬ 
ior;  two  elements  in  different  D-classes  are  functionally  dissimilar,  in  that  no  plan  containing  either  can 
exhibit  the  same  behavior  as  any  plan  containing  the  other. 

Green’s  relations  organize  the  actions  of  a  transformation  semigroup  according  to  their  functional 
properties,  and  organize  the  states  according  to  the  behaviors  that  can  be  exhibited  from  them.  This 
allows  us  to  define  a  basic  local  neighborhood  of  a  semigroup: 

Definition.  Given  a  transformation  semigroup  S  =  (Q,  A)  and  a  point  q  e  Q  ,  a  D-class  of  S  contain¬ 
ing  an  action  whose  domain  includes  q  is  called  a  basic  local  neighborhood  of  q. 

Each  basic  local  neighborhood  in  state  space  consists  of  behaviorally  similar  states.  If  the  agent’s 
perceptual  capabilities  are  not  sufficiently  precise  to  distinguish  different  neighborhoods,  then  it  cannot 
plan  and  move  effectively  in  the  space.  A  state  q  may  be  in  more  than  one  basic  local  neighborhood. 
Clearly,  every  element  of  Q  is  in  at  least  one  basic  local  neighborhood.  The  set  of  all  basic  local  neigh¬ 
borhoods  forms  a  neighborhood  base  for  a  topology  on  Q;  however,  this  direction  will  not  be  pursued  in 
this  paper,  so  we  will  often  just  refer  to  basic  local  neighborhoods  as  neighborhoods. 

Utilizing  Green’s  relations,  Lallement  defines  a  local  coordinate  system  for  a  neighborhood: 

Definition.  Let  D  be  a  D-class  of  a  semigroup,  and  let  H^p  (  leA.peP  )be  the  set  of  H-classes 
contained  in  D  (indexed  by  their  L-class  and  R-class).  A  coordinate  system  for  D  is  a  selection  of  a 
particular  H-class  H0  contained  in  D,  and  of  elements  q^p,  q'  xp>  r^p,  r’^p  e  S1  with  X  e  A,  p  e  P  such 
that  the  mappings  x  ->  qxpxr^p  and  y  q’xpyr’xp  are  bijections  from  H0  to  H^p  and  from  H^p  to 
H0,  respectively.  A  coordinate  system  for  D  is  denoted  by  [H0;{(qxp.  q'  xp.  D,p>  r'  ^p):  X  e  A,  p  e  P}]. 
There  may  be  more  than  one  local  coordinate  system  for  a  D-class.  Each  coordinate  system  gives  a 
matrix  representation  in  much  the  same  way  that  a  coordinate  system  in  a  vector  space  gives  a  matrix 
representation,  permitting  us  to  change  coordinates  within  a  local  neighborhood  by  performing  a  simi¬ 
larity  transformation  (inner  automorphism)  in  the  usual  way  (the  reader  is  referred  to  Lallement  for 
details). 

Each  local  coordinate  system  within  the  semigroup  expresses  a  distinct  syntactic  labeling  of  a  sub¬ 
system,  as  we  can  choose  a  point  in  the  neighborhood  to  be  the  “origin”,  and  label  points  in  the  neigh¬ 
borhood  according  to  the  actions  that  map  the  origin  to  them.  In  addition,  each  coordinate  system  can  be 
used  to  create  a  matrix  representation  for  the  semigroup,  in  much  the  same  way  that  a  coordinate  system 


for  a  vector  space  yields  matrices  for  the  linear  transformations  of  the  vector  space.  The  reader  is 
referred  to  Lallement  (1974)  for  details. 


Example:  The  Towers  of  Hanoi 


Let  us  number  the  nine  states  of  the  2-disk  Towers  of  Hanoi  as  follows: 


Let  the  two  possible  actions  be  denoted  by  “x”  and  “y”. 

x=123456789  y=123456789 

231564897  4  83 

“x”  moves  the  small  disk  right  one  peg  (wrapping  around  from  peg  3  to  peg  1),  and  “y”  moves  the 
large  disk  one  peg  to  the  left  (wrapping  around  from  peg  1  to  peg  3).  In  the  figure,  “x”  is  shown  by  nar¬ 
row,  counterclockwise  arrows,  and  “y”  is  shown  by  thick,  counterclockwise  arrows.  These  two  actions 
generate  a  semigroup  of  31  distinct  partial  functions  on  the  states.  Green  s  relations  for  this  semigroup 


are: 


D2 


D1 

X,  XX,  XXX 

xyx 

xyxxyx 

xyxxyxxyx 

xy 

xyxxy 

xyxxyxxy 

xyxx 

xyxxyxx 

xyxxyxxyxx 

xxyx 

xxyxxyx 

xxyxxyxxyx 

xxy 

xxyxxy 

xxyxxvxxy 

xxyxx 

xxyxxyxx 

xxyxxyxxyxx 

yx 

yxxyx 

yxxyxxyx 

y 

yxxy 
yxxyxxy  • 

yxx 

yxxyxx 

yxxyxxyxx 
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There  are  three  D  classes,  shown  as  the  three  separate  large  boxes.  In  each  D  class,  the  R  classes  are 
rows  and  the  L  classes  are  columns,  and  they  intersect  in  the  small  boxes,  which  are  H  classes.  Note  that 
DO  and  D1  consist  of  only  one  R  class  and  one  L  class,  and  hence  one  H  class.  The  idempotents  are  in 
bold  type. 

There  are  no  nontrivial  inner  automorphisms  of  DO  and  Dl.  The  group  of  inner  automorphisms  of  D2 
is  a  cyclic  group  of  order  three.  These  coordinate  transformations  are  global  within  D2,  but  are  local  in 
the  semigroup.  These  inner  automorphisms  are  calculated  by  the  matrix  techniques  explained  by  Lalle- 
ment  (1979).  A  generator  for  this  group  is  the  automorphism  that  maps  xyx  to  xxy,  xxy  to  yxx,  and  yxx 
to  xyx.  Factoring  D2  by  this  map  gives: 

Define  z  =  case  {  little  disk  left  of  large  disk:  xxy 
little  disk  on  large  disk:  xyx 

little  disk  right  of  large  disk:  yxx  } 

where  z  is  a  new  symbol. 

x  is  as  before,  and  z  moves  both  disks  left  one  peg.  z  is  a  macro-action,  which  is  implemented  as  a 
disjunction  of  sequences  of  actions,  x  and  z  are  independent  controls;  x  solves  the  position  of  the  small 
disk,  and  z  solves  the  big  disk,  x  does  not  change  the  position  of  the  big  disk,  and  z  does  not  change  the 
relative  positions  of  the  disks.  The  disks  can  be  solved  in  either  order,  as  the  new  representation  is  an 
abelian  group.  This  new  representation  captures  the  important  property  of  the  Towers  of  Hanoi  task:  the 
disks  can  be  solved  in  any  order.  In  the  original  theory,  this  property  was  obscured  by  details  of  the 
implementations  of  the  disk  moves. 

This  decomposition  applies  not  only  to  the  two-disk  problem,  but  to  all  Towers  of  Hanoi  problems. 
This  is  seen  by  forming  free  products  of  the  semigroup  with  itself,  amalgamating  the  coordinate  axes  in 
all  possible  ways.  There  are  three  free  products  of  this  semigroup  with  itself  that  amalgamate  coordi¬ 
nates:  D2  can  be  identified  with  itself,  Dl  with  itself,  and  D2  with  Dl  (by  mapping  xxy,  xyx,  and  yxx  to 
x).  These  correspond  to  identifying  the  moves  of  the  larger  disk  of  one  copy  of  the  semigroup  with  the 
moves  of  the  larger  disk  in  another  copy,  identifying  the  moves  of  the  smaller  disk  with  the  moves  of  the 
smaller  disk,  and  identifying  the  moves  of  the  larger  disk  with  the  moves  of  the  smaller  disk,  respec¬ 
tively.  The  result  of  this  construction  is  the  three-disk  Towers  of  Hanoi,  which  is  viewed  as  three  copies 
of  the  two-disk  task  running  concurrently. 
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From  the  logical  perspective,  this  can  be  viewed  as  composing  three  copies  of  a  theory  for  the  two- 
disk  Towers  of  Hanoi,  to  yield  a  valid  theory  for  the  three-disk  problem.  Copies  of  the  theory  are  joined 
by  unifying  variables  between  theories.  Two  copies  joined  at  the  middle  disk  will  not  suffice,  as  then  the 
largest  disk  could  be  placed  on  the  smallest  disk.  A  third  copy  of  the  theory  prohibits  this.  The  purpose 
of  computing  a  coordinate  system  is  that  the  coordinate  axes  determine  what  to  unify. 

The  interaction  of  these  three  smaller  tasks  yields  a  set  of  relations.  For  example,  consider  moving 
the  middle  disk  one  peg  to  the  right.  When  considering  this  disk  as  the  larger  disk  in  the  task  consisting 
of  the  upper  two  disks,  this  move  is  xyxxy  (in  the  state  when  all  disks  are  on  the  left  peg),  yxxyx  (when 
the  smallest  disk  is  to  the  right  of  the  middle  disk),  or  xxyxxyxx.  On  the  other  hand,  when  considering 
this  disk  as  the  smaller  disk  in  the  task  consisting  of  the  lower  two  disk,  this  move  is  x  (in  the  other  copy 
of  the  semigroup).  This  means  that  we  add  the  relation  xyxxy  =  yxxyx  =  xxyxxyxx  to  the  presentation  of 
the  semigroup. 

This  method  of  composing  larger  tasks  from  smaller  ones  guarantees  that  this  decomposition  general¬ 
izes  to  any  Towers  of  Hanoi  problem  with  n  disks,  by  considering  a  coordinate  system  for  the  three-disk 
task,  and  forming  free  products  with  amalgamation  in  the  same  manner  as  before.  These  free  products 
with  amalgamation  will  always  reduce  to  free  products  with  amalgamation  of  coordinate  systems  of  the 
two-disk  task,  so  that  by  induction  the  properties  of  the  two-disk  task  determine  the  properties  of  all 
Towers  of  Hanoi  tasks. 

Example:  n-Queens 

The  n-Queens  task  is  to  place  n  queens  on  an  nxn  chessboard  so  that  no  two  queens  are  on  the  same 
row,  column,  or  diagonal.  We  are  posed  the  task  of  enumerating  all  solutions  for  a  given  value  of  n. 
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Q 

2 

Q 

3 

Q 

4 

Q 

1 

2 

3 

4 

A  direct  way  to  construct  a  theory  of  this  domain  that  specifies  the  legal  solutions  is  to  specify  a 
solution  as  satisfying  a  predicate  that  is  the  conjunction  of  the  goal  conditions:  no-two-queens-on-same- 
row  &  no-two-queens-on-same-column  &  no-two-queens-on-same-up-diagonal  &  no-two-queens-on- 
same-down-diagonal.  In  Smith  (1990),  it  is  shown  how  this  domain  theory  can  be  combined  with  a  the- 
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ory  of  global  search  to  derive  a  correct  high-level  program.  However,  even  with  efficient  heuristics,  the 
search  space  grows  exponentially  with  the  size  of  the  problem.  For  example,  in  the  4-queens  problem 
shown  above,  there  is  no  solution  if  we  place  the  first  queen  in  the  upper  left-hand  corner  or  lower-left 
hand  corner,  but  the  system  cannot  know  this  until  it  has  searched  all  ways  of  placing  the  remaining 
queens.  This  exponential  growth  in  the  number  of  possibilities  is  the  well-known  Achilles’  heel  of  prob¬ 
lem  solving,  as  it  precludes  the  efficient  solution  of  very  large  problems.  Global  search  cannot  be  used  to 
enumerate  the  solutions  to  the  million-queens  problem. 

Effective  search  heuristics  are  known  for  finding  a  single  solution  to  a  problem  of  that  magnitude,  but 
they  cannot  efficiently  enumerate  all  solutions.  This  prevents  consideration  of  the  space  of  solutions, 
and  in  particular  prevents  optimal  problem  solving.  More  seriously,  the  heuristic  approach  requires  engi¬ 
neering  a  new  heuristic  for  each  problem  class.  This  does  not  even  partially  solve  the  designer  s  problem 
of  finding  an  efficient  algorithm;  it  merely  transforms  it  into  the  problem  of  finding  an  efficient  heuris¬ 
tic. 

What  the  designer  needs  is  a  method  for  finding  good  decompositions  of  the  problem.  A  good  decom¬ 
position  of  the  problem  has  at  least  the  following  two  properties:  it  splits  the  problem  into  components 
such  that  solutions  within  the  components  compose  to  form  solutions  to  the  whole  problem,  and  it 
applies  to  all  problems  in  the  class. 

The  two  forms  of  decomposition  usually  implemented  in  systems  such  as  KIDS  (Smith,  1990)  are 
first-rest  decomposition  and  half-half  decomposition.  In  first-rest  decomposition,  the  first  part  of  the 
problem  description  is  solved  and  its  solution  is  combined  with  the  solution  of  the  rest  (this  constituting 
a  recursive  call  to  first-rest  decomposition.)  In  half-half  decomposition,  the  problem  description  is  split 
roughly  in  half,  and  solutions  of  each  half  are  combined  (again  leading  to  recursive  calls.)  Mergesort  is 
a  typical  example  of  half-half  decomposition,  while  bubble  sort  is  an  example  of  first-rest  decomposi¬ 
tion. 

These  forms  of  decomposition  are  not  generally  applicable.  For  example,  neither  of  these  forms  of 
decomposition  applies  to  the  n-queens  problem.  In  first-rest  decomposition,  we  would  place  one  queen, 
then  place  the  remaining  n-1  queens.  But  as  we  saw  above,  if  we  place  the  first  queen  in  the  wrong  place 
(a  corner),  the  1-queen  solution  will  not  compose  with  any  solution  to  the  n-1  queens  to  give  a  valid  n- 
queen  solution.  Similarly,  half-half  decomposition  would  find  any  solution  for  placing  2  queens,  and 
compose  it  with  another  2-queen  solution  (which  could  be  a  copy  of  itself.)  This  need  not  be  a  solution 
to  the  4-queen  problem. 

The  next  figure  shows  the  hierarchy  of  local  symmetries  for  the  solution  to  the  4-queens  problem. 
The  top-level  local  symmetry  is  that  of  the  3x4  subproblems.,  but  the  four  3x4  subproblems  in  the  4x4  are 
mapped  to  each  other  only  by  the  cyclic  global  symmetry  of  order  4.  The  first  new  local  symmetries  arise 
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from  considering  the  2x3  subproblem  that  contains  2  queens.  The  four  instances  of  this  subproblem  are 
of  course  also  mapped  to  each  other  by  the  cyclic  global  symmetry,  but  also  map  to  themselves  by  a  local 
symmetry  of  order  2.  The  next  figure  shows  these  four  instances,  together  with  the  1x2  subproblems  that 
arise  from  intersecting  the  2x3  subproblems: 


As  in  the  other  problems  considered,  we  construct  larger  instances  of  the  n-queens  problem  from  cop¬ 
ies  of  the  4x4  solution,  amalgamating  the  local  symmetries.  For  example,  the  6-queens  solution  can  be 


constructed  from  four  copies  of  the  4-queens  solution  by  amalgamating  2x3  subproblems: 


The  6-queens  solution  can  be  viewed  as  three  4-queens  solutions  in  this  way. 

Stated  another  way,  the  construction  of  the  6-queens  solution  from  multiple  4-queens  solutions  can 
be  thought  of  as  follows:  take  2  copies  of  the  4-queens  solution  and  amalgamate  a  2x3  subproblem  in 
one  copy  with  a  2x3  in  the  other.  There  are  6  distinct  queens.  The  four  queens  in  the  first  copy  are  guar¬ 
anteed  to  be  on  their  own  rows,  columns,  and  diagonals,  and  the  four  queens  in  the  second  copy  are  sim¬ 
ilarly  guaranteed  to  satisfy  the  solution  conditions.  But’  we  are  not  guaranteed  that  the  two  non- 
amalgamated  queens  in  one  copy  are  on  distinct  rows,  columns,  or  diagonals  from  the  two  non-amalgam- 
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ated  queens  in  the  other  copy.  It  is  necessary  that  those  four  queens  also  satisfy  the  solution  conditions, 
i.e.,  we  need  them  to  be  a  4-queens  solution  by  themselves.  Therefore,  composing  three  4-queens  solu¬ 
tions  in  this  way  is  both  necessary  and  sufficient  to  give  a  6-queens  solution,  which  implies  that  all  6- 
queens  solutions  are  constructed  in  this  way. 

Thus,  we  can  construct  n-queens  solutions  from  smaller  solutions.  It  is  not  necessary  to  search  a 
space  containing  non-solutions.  Instead,  analysis  of  the  structure  of  the  base  case  reveals  both  the 
decompositions  of  the  base  case  and  the  inductive  definition  of  the  class  of  problems  with  the  same 
structure.  Ideally,  a  system  for  software  design  should  have  the  capability  to  perform  this  analysis,  to 
synthesize  programs  that  construct  the  solution  space  directly  in  this  fashion  whenever  possible. 


Summary 

A  good  formulation  of  a  domain  theory  omits  irrelevant  detail  and  identifies  the  essential  structure  of 
the  domain.  In  particular,  it  identifies  the  subgoal  decompositions  that  lead  to  efficient  solutions.  This 
paper  has  described  a  research  project  exploring  the  mathematical  foundations  of  theory  formulations. 
An  analogy  is  constructed  between  state  spaces  and  physical  space.  In  this  analogy,  a  domain  theory 
gives  the  laws  of  motion  in  the  state  space,  and  a  coordinate  system  gives  a  formulation  of  a  theory.  The 
formulation  of  theories  is  then  governed  by  the  established  principles  of  symmetry  and  invariance.  The 
symmetries  of  state  spaces  of  base  cases  can  be  used  to  formulate  invariants  for  recursive  programs  that 
solve  an  entire  class  of  tasks.  The  goal  of  this  research  is  a  comprehensive  theory  of  the  symmetries  and 
invariants  of  programs,  and  an  implemented  system  for  program  synthesis  based  on  the  theory. 
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Abstract 

A  low  complexity  space-time  adaptive  processing  (STAP)  scheme,  called  the  adaptive 
displaced  antenna  (ADPCA)  technique  is  analyzed.  Conditions  under  which  this  scheme  is 
optimum  are  given  for  the  case  of  observations  with  a  known  covariance  matrix.  These 
conditions  can  be  satisfied  if  exactly  two  pulses  are  used  in  the  ADPCA  scheme,  but  they 
can  not  be  satisfied  for  ADPCA  schemes  which  use  more  than  two  pulses.  An  interesting 
interpretation  of  the  ADPCA  scheme  is  provided  which  may  explain  its  good  performance 
for  some  airborne  surveillance  radar  problems.  For  some  cases  with  noise  and  ground  clutter 
only,  the  ADPCA  scheme  approximates  a  pulse  canceling  scheme  applied  to  optimally  pro¬ 
cessed  pairs  of  pulses.  The  approximation  is  exact  for  cases  where  the  clutter  is  produced 
by  ground  scatterers  with  zero  azimuth  angle.  In  other  cases  considered  the  approximation 
error  was  found  to  be  quite  small  for  cases  where  the  clutter  is  produced  by  ground  scatter¬ 
ers  with  any  azimuth  angle.  The  detection  performance  was  analyzed  for  a  case  where  the 
training  data  does  not  include  the  effects  of  a  scatterer  that  produces  clutter  in  the  range 
cell  under  test.  Here  the  ADPCA  scheme  outperforms  scheme  which  is  optimum  for  the  case 
of  a  known  covariance  matrix. 


An  Analysis  of  Adaptive  DPCA 
Rick  S.  Blum 


4-1  Introduction 

The  adaptive  displaced  phase  centered  antenna  (ADPCA)  technique  is  a  low  complexity 
alternative  to  joint-domain  optimum  space-time  adaptive  processing  (STAP)  [1].  It  can 
provide  benefits  for  cases  where  the  training  data  is  missing  some  details  of  the  interference 
environment.  ADPCA  gets  its  name  from  the  displaced  phase  centered  antenna  (DPCA) 
technique,  which  has  been  extremely  useful  in  reducing  the  deleterious  effects  of  platform 
motion  in  radar  systems  for  a  number  of  years  [2,  3],  but  there  is  not  much  similarity  in  these 
two  techniques.  One  similarity  is  that  both  techniques  were  developed  to  reduce  the  ground 
clutter  seen  by  airborne  radar.  The  basic  DPCA  technique  [4,  5]  involves  either  physically 
or  electronically  displacing  the  receive  antenna  to  compensate  for  platform  motion.  Here  we 
analyze  the  ADPCA  technique  which  is  part  of  the  Rome  Laboratory  Space-Time  Adaptive 
Processing  Algorithm  Development  Tool  (RLSTAP/ADT)  1 . 

In  the  ADPCA  technique  we  compare  the  test  statistic 

T(X)  =  |V(S')hR-1V(V)|  (1) 

to  a  threshold  to  make  a  decision  where  V(X )  is  the  vector  of  complex  data  samples  observed, 
V(S)H  is  the  conjugate  transpose  of  the  steering  vector  V(S),  and  R  is  the  estimated  noise- 
plus-clutter  covariance  matrix.  If  T(X)  is  larger  than  the  threshold  then  a  decision  is  made 
that  a  target  is  present.  The  threshold  that  T(X)  is  compared  to  can  be  fixed  or  it  can 
be  made  adaptive  to  provide  a  constant  false  alarm  rate  (CFAR).  We  do  not  discuss  a 
CFAR  scheme  here,  but  we  refer  the  reader  to  [7]  which  discusses  an  approach  for  achieving 
CFAR  when  using  a  test  statistic  which  is  similar  to  T(X).  Assume  that  samples  are  taken 
from  from  M  different  pulse  returns  received  at  N  antenna  array  elements.  Each  return 

1see  http:www.rl.af.mil:8001/Technology/Demos/STAP  for  more  information  on  RLSTAP. 
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is  assumed  to  contain  a  possible  signal  in  additive  clutter-plus-noise.  The  possible  signal 
corresponds  to  a  return  from  a  transmitted  pulse  and  this  signal  indicates  the  presence  of 
a  target  at  a  particular  range.  For  simplicity,  we  assume  periodically  transmitted  pulses. 
Denote  the  observation  corresponding  to  the  jth  pulse  at  the  kth  antenna  element  as  Xkj.  Each 
observation  is  a  complex  number  corresponding  to  the  in-phase  and  quadrature  components 
of  the  complex  envelope  of  the  matched  filtered  (matched  to  the  pulse  waveform)  received 
signal.  The  observations  are  ordered  as 

V(X)  =  (xn,  x2i,  •  •  •  >  xni,  Z12,  •  •  •  xnm)T  (2) 

where  aT  denotes  the  transpose  of  the  vector  a.  Define  Xkj  =  Xkj  —  E[xkj\,  k  =  1 ,N,j  = 
1, ,  M.  The  estimated  noise-plus-clutter  covariance  matrix  R,  whose  (k  —  j) th  entry  we 
denote  by  rjy,  is  an  estimate  of 

E[{V{X)  -  E{V{X))){V{X)  -  E(V(X))h ]  = 

^[|xn|2]  E[x  11^21]  •••  E[xhXhM\ 

E[x2ix*n]  E[\x21\2}  ...  E[x21x*NM\ 

E \%NMX  11]  E[xpfMX 21]  . . .  E[|xwm|  ] 

which  is  the  true  covariance  matrix.  The  estimate  is  based  on  a  set  of  reference  data,  typically 
chosen  from  the  surrounding  range  cells. 

Finally,  if  the  array  is  looking  broadside  then  the  steering  vector  for  the  ADPCA 
technique  is  composed  of  the  binomial  coefficients,  with  each  coefficient  repeated  for  the 
number  of  antennas  and  where  every  other  coefficient  is  altered  in  sign  (start  with  positive). 
As  a  particular  example,  we  have 

V(S)  =  (1, 1, 1, 1,  -2,  -2,  -2,  -2, 1, 1, 1,  If  (4) 

for  a  three  pulse,  four  antenna  case.  If  R~l  is  an  identity  matrix  then  application  of  this 
steering  vector  (as  in  (1))  has  a  simple  interpretation.  At  each  antenna,  subtract  the  ampli¬ 
tude  of  neighboring  pulses.  Next  subtract  neighboring  results  and  repeat  this  process  until 
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only  a  single  output  is  obtained.  Finally  the  outputs  from  each  antenna  are  summed.  The 
processing  at  a  single  antenna  is  illustrated  in  Figure  1  for  a  three  pulse  case.  This  is  clearly 
a  type  of  multiple  pulse  canceling  at  each  antenna. 


Pulse  1 


Figure  1:  Block  diagram  of  pulse  differencing  at  one  antenna. 


More  generally,  the  elements  of  the  steering  vector  in  (4)  are  multipled  by  the  appro¬ 
priate  member  of  an  exponential  progression  to  steer  the  beam  in  the  direction  of  the  target. 
To  be  more  explicit,  for  the  rest  of  this  paper  we  consider  a  uniformly  spaced  linear  array 
antenna  as  described  in  [5].  We  assume  the  array  is  oriented  along  the  x-axis  and  consider  a 
target  at  azimuth  angle  cj)  and  elevation  6.  To  steer  the  beam  in  the  target  direction  [5],  we 
use 

v(s)  =  u,pi)...,p 

for  a  three  pulse,  N  antenna  case  where 

pv  =  exp(j'27ri/t), 
dcos  6  sin  (f> 


(6) 

(7) 
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In  (6)  and  (7)  d  is  the  distance  between  the  equally  spaced  elements  in  the  line  array  and  A 
is  the  radar  wavelength. 


4-2  Optimality  of  ADPCA  for  Two  Pulse  Case 

The  Neyman-Pearson  optimum  STAP  scheme  (assuming  the  estimate  of  R  is  exactly  equal 
to  the  true  covariance  matrix)  [6],  under  the  typical  assumptions  of  a  known  signal  with 
unknown  amplitude  and  uniform  phase  angle  which  is  observed  in  additive  Gaussian  noise- 
plus-clutter.  uses  a  test  statistic  which  is  of  the  form  of  (1)  with  F(iS')  a  scalar  multiple  of 
the  complex  envelope  of  the  received  signal  to  be  detected.  Thus  y(S')  should  be  a  scalar 
multiple  of  [6] 

Pl/PljJ  >  PuPu)  >  •  •  •  )  Pv  Pu  > 


•  •  •  ) 


0  M-l  1  M-l 
PuPu  >  PuPu  ' 


JV-l  M-1\T 
,Pu  Pu  ) 


(8) 


for  the  N  element,  M  pulse  case,  where 

Pu  =  exp(;27rcht),  (9) 

2  vT 

v  is  the  velocity  of  the  antenna  array  and  T  is  the  pulse  repetition  interval.  Using  the 
test  statistic  in  (1)  with  (8)  for  the  case  where  R  is  a  maximum  likelihood  estimate  of  the 
covariance  matrix  is  sometimes  called  the  sample  matrix  inversion  scheme.  In  general,  we 
see  from  (5)  and  (8)  that  the  ADPCA  technique  is  generally  different  from  the  optimum 
STAP  scheme.  Now  let  us  show  one  case  where  they  are  the  same. 

Now  consider  a  case  where  only  two  pulses  are  used  with  N  antenna  elements.  Further, 
assume  that  the  data  received  by  the  array  is  preprocessed  to  simulate  controlled  platform 


motion  so  that  it  appears  that  v  =  A / (4 T)  in  (10).  Under  these  conditions  we  find  that 
Cjt  =  1  and  thus  the  optimum  steering  vector  for  this  case  (8),  which  is 


v(S)  =  (pI,pI,...,p* 


N- 1 


-pl 


,  ~PV 


T 


(11) 


is  exactly  equal  to  the  ADPCA  steering  vector  for  this  case. 


4-3  Optimality  of  ADPCA  in  General  Case 

One  might  ask  “for  what  other  cases  is  the  ADPCA  scheme  described  here  is  identical  to 
the  optimum  STAP  scheme  in  (8)”.  From  (1)  this  requires  that 

Vd{S)hR-1  =  CVoiSfR-1  (12) 

where  Vd(S)  is  the  ADPCA  steering  vector  (as  in  (5))  and  Vo(S)  is  the  steering  vector  for 
the  optimum  STAP  scheme  (as  in  (8))  and  C  is  some  constant  multiplier  which  can  take  on 
any  finite  complex  value.  Assuming  R~1  must  have  an  inverse  R  we  can  multiply  both  sides 
of  (12)  on  the  right  by  R  to  obtain 

VD(S)  =  CVo(S)  (13) 

as  an  equivalent  statement  to  (12).  Thus  the  only  way  that  the  ADPCA  scheme  is  optimum 
is  if  it  uses  the  same  steering  vector,  to  within  a  complex  constant,  as  the  optimum  STAP 
scheme.  This  is  true  regardless  of  iZ-1,  assuming  it  has  an  inverse.  This  indicates  that  the 
ADPCA  scheme  can  only  be  optimum  for  the  two  pulse  case  described  above.  In  any  case 
with  more  that  two  pulses,  the  binomial  coefficients  have  magnitudes  which  vary  and  are 
not  constant  as  required  by  (13). 


4-4  Interpretation  of  ADPCA 

We  have  already  shown  that  the  ADPCA  approach  we  study  here  is  not  optimum  for  any 
case  with  more  than  two  pulses.  On  the  other  hand  the  multipulse  canceling  scheme  that  the 
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ADPCA  technique  uses  has  been  used  successfully  in  the  past  in  a  number  of  applications. 
This  technique  operates  on  pairs  of  pulses.  Thus,  one  might  ask  if  the  ADPCA  technique 
described  here  can  be  shown  to  be  equivalent  to  our  multipulse  canceling  scheme  applied 
to  the  optimum  processing  of  pairs  of  pulses,  using  (1)  and  (8).  Here  we  investigate  this 
hypothesis. 


4-4.1  Single  Antenna 


To  simplify  matters,  we  initially  consider  a  three-pulse  case  with  a  single  antenna  as  illus¬ 
trated  in  Figure  2.  We  want  to  know  if 


/ 


1 


\ 

T 

( 

TTi 

2 

r12 

1 

K  r*13 

( 

1 

)7 

\ 

-1 

)  l 

l 

1 

)T( 

V 

-1 

1  \ 

(14) 


V  rh  r33  / 


for  a  given  (rn,  ri2,  r22,  7-23,  ^33)-  In  (14)  we  have  again  used  the  notation  that  the  (Jc— j)th 
entry  of  R  is  denoted  by  r*,j.  Note  that  the  conjugate  symmetry  in  the  matrices  in  (14)  is  a 
requirement  of  any  covariance  matrix  (assuming  an  exact  estimate).  After  simplification  we 
find  that  a  solution  to  (14)  requires 

r22  +  7-21 


B- 


riir22  ~  \r  12 


—  n22r33  —  |r23|2  +  2r2ir33 

— 2r23r3i  +  r2lr*2Z  -  r22r*3 


(15) 


ri2  +  ni 


7-33  +  rt 


23 


T’227'33  —  |7'23| 
|2 


=  -ri2r33 


rur22  -  |ri2|2 
-T'137’23  +  2rnr33  -  2|ri3|2  +  rur2*3  -  r^r^* 


(16) 
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f  7~23  +  r2  2 
T22T33  —  I  ^23 


—  =  ri2r23  —  7'i3T’22  +  2rnr23 

—2ri3r2i  +  rnr22  —  ^*2 


(17) 


where 


5  =  rrir22r33  -  rn  |r23 1 2  -  !ri2|2r33 
+^27'i3^32  +  rl3r12r23  ~  V\3\2r22 


(18) 


Since  (15)  through  (17)  are  a  set  of  nonlinear  equations  it  is  difficult  to  say  how  many 
solutions  exist.  One  can  verify  that  provided  we  choose 

T  13^12  +  T’127’23  ~  r13r22 


n  1  = 


and 


^33  — 


^23 

*12*23  +  r23ri3  -  **22*13 
7-21 


(19) 


(20) 


then  we  obtain  a  solution  to  (15)  through  (17)  for  any  r22,r23)r3i,r32.  While  we  can  not 
say  that  all  solutions  must  be  of  the  form  given  in  (19)  and  (20),  the  set  of  solutions  given 
by  all  possible  choices  of  r22,r23,r31,r32  and  (19)  and  (20)  is  quite  large  and  contains  some 
interesting  cases. 


Figure  2:  Block  diagram  showing  canceling  following  optimum  two-pulse  processing. 
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4-4.2  Ground  Clutter 


Consider  for  example  the  return  from  a  single  discrete  ground  clutter  source  at  an  unam¬ 
biguous  range  and  at  azimuth  angle  (f>c  and  elevation  6C.  The  normalized  Doppler  frequency 
of  the  echo  from  this  source  of  clutter  is 

A  2 vT  cos  6C  sin  (f>c 

c ~  - 

A 

and  we  define 

pu<c  =  exp(j2Tru}c)  (22) 

and 


K  =  (r'^cpl.cplf1 


(23) 


The  clutter  covariance  matrix  for  this  single  patch  of  ground  clutter  is  calculated  as 
Rc  =  tcVcVj  [5]  where  £c  is  related  to  the  expected  value  of  the  square  of  the  amplitude  of 
the  clutter  return  which  can  be  calculated  from  the  system  parameters  for  a  given  clutter 
model  [5].  A  direct  calculation  shows  that  this  covariance  matrix  Rc  satisfies  (19)  and  (20) 
for  any  £c  in  the  limit  as  (j)c  — >  0  (this  is  also  true  as  4>c  — >  d=90°).  This  means  that  the 
conditions  in  (19)  and  (20)  are  approximately  satisfied  for  main  beam  clutter. 

Since  the  conditions  in  (19)  and  (20)  are  satisfied  for  any  £c  as  (j)c  — >  0  (they  are 
invariant  to  a  multiplication  of  R  by  a  scale  factor),  they  are  also  satisfied,  as  4>c  —*  0,  by 
the  autocorrelation  matrix  which  results  from  considering  a  sum  of  any  number  of  clutter 
patches  provided  the  clutter  amplitudes  from  different  patches  are  uncorrelated.  The  uncor¬ 
related  amplitude  assumption  is  frequently  made  in  practice  [5].  Further,  if  additive  noise  is 
considered,  the  resulting  covariance  matrix  still  satisfies  the  conditions  in  (19)  and  (20).  A 
key  result  that  enables  the  above  limiting  statements  to  be  easily  verified  is  that  (19)  and 
(20)  are  satisfied  by  any  covariance  matrix  of  the  form 


R 


c 


1  C  +  N  C  C  N 
C  C  +  N  C 
K  C  C  C+N } 


(24) 
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for  any  C  and  N.  If  the  clutter  arrives  from  an  angle  with  (f)c  close  to  0  then  the  conditions  in 
(19)  and  (20)  are  nearly  satisfied.  In  some  practical  cases  the  conditions  are  approximately 
satisfied  for  all  angles.  This  is  discussed  further  in  the  next  section,  where  we  consider  a 
case  with  two  antennas. 


4-4.3  Antenna  Arrays 

The  above  conclusions  extend  directly  to  cases  with  two  antennas  as  we  now  demonstrate. 
For  the  three  pulse  and  two  antenna  case  we  want  to  show  that 


/  1  ^ 
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(  *11  N 
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'  rn  r12  ...  r*i6  ^ 
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r\2  r22  r26 
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V  *33  / 
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1  *11  ^ 

1 

r*2  r22  r23  t-24 

*21 

-1 

r13  r23  r33  r3A 
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V  r14  r24  r34  r44  ) 

v  X22  y 
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(  \ 

-1 

/  \ 

1 

r  33  7-34  ^35  ^36 

*12 

1 

1”34  ^44  ^45  ^46 

*22 

-1 

r35  r45  r55  r56 

*13 

^  r36  r46  r56  r66  j 

V 2:23  / 

(25) 


Due  to  the  increase  in  the  number  of  unknowns  the  conditions  corresponding  to  (19) 
and  (20)  are  quite  complicated  and  difficult  to  interpret  here.  However  similar  statements 
can  be  made  with  respect  to  ground  clutter  as  for  the  single  antenna  case.  With  pUjC  as 
defined  in  (22)  and 

ve  =  (pIpI ,pIpI,pIpI, pIpI , vlvl ,pIpI)t  (26) 
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the  clutter  covariance  matrix  for  a  single  patch  of  ground  clutter  at  an  unambiguous  range 
and  at  azimuth  angle  <f>c  and  elevation  9C  is  again  calculated  as  Rc  =  (cVcV^ .  Once  again, 
a  careful  calculation  shows  that  this  covariance  matrix  Rc  will  satisfy  (25)  for  any  £c  in 
the  limit  as  (j)c  — *  0  provided  sufficient  noise  is  added  so  that  the  inverses  in  (26)  are  well 
behaved.  The  conditions  which  require  well-behaved  inverses  are  also  present  in  the  single 
antenna  case,  but  these  problems  can  be  avoided  by  using  the  conditions  in  (19)  and  (20) 
which  allow  one  to  approach  limiting  cases  without  numerical  instabilities  even  when  the 
inverses  are  not  well  behaved. 

One  can  also  show  that  in  some  practical  cases  the  conditions  on  Rc  given  in  (25) 
are  approximately  satisfied  for  clutter  with  small  but  nonzero  <f>c.  As  one  example  consider 
a  case  with  noise  power  normalized  to  1  and  clutter  to  noise  power  ratio  of  1  x  108.  We 
obtained  plots  of  error,  in  satisfying  (25).  versus  the  clutter  azimuth  angle  <j>c  and  /?  where 

gives  the  slope  of  the  clutter  ridge  [5].  For  all  90°  <  (f>c  <  90°  and  0  <  (3  <  10  we  found 
an  error  less  than  10~4.  Similar  error  results  were  also  obtained  for  other  clutter-to-noise 
ratios  and  for  cases  with  more  than  two  antennas.  Specifically  we  obtained  errors  less  than 
2.5  x  10~4  for  cases  with  3, 4, 5, 6,  7,  8  and  9  antennas. 


4-5  Performance  Loss  Compared  to  Optimum 

If  the  environment  can  be  exactly  characterized  (R  is  equal  to  the  true  covariance  matrix) 
and  the  exact  position  of  a  target  is  known  then  one  can  easily  characterize  the  loss  in 
performance  that  results  from  using  ADPCA  as  opposed  to  an  optimum  scheme.  In  this 
case  the  loss  is  exclusively  due  to  using  a  non-optimum  steering  vector,  as  shown  in  (13), 
and  this  loss  is  measured  by  the  signal-to-interference-plus-noise  ratio  (SINR)  of  the  ADPCA 
technique  divided  by  the  SINR  of  the  optimum  technique  (assuming  perfect  knowledge  of 
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R )  which  is 

SINRp  =  \Vp(S)H  R~1Vq(S)\2 

SINRo  VDiS^R-'VpWVoiS^R-WoiS)'  {  ] 

smrO  ranges  between  0  and  1.  From  the  Schwartz  inequality,  the  maximum  occurs  when 
(Vd(S)h R-1)1*  =  Vo(S).  The  minimum  occurs  when  (Vp(S)H  R-1)11  is  orthogonal  to  Vo(S). 
In  general  the  performance  is  somewhere  in  between  these  extremes  and  depends  on  the  exact 
value  of  R ~l  and  Vq(S).  Define  the  unit  vectors 


UD(S) 


R-^VpjS) 

l|£-1/2^(S)|| 


where 


U  ls]  =  ^RMS) 

°l  }  ||jj-W0(S)|| 

denotes  the  magnitude  of  a  vector  V.  Using  (29)  and  (30)  in  (28)  yields 


SINRp 

SINRo 


=  \UD(S)Uo(S)\2. 


From  (31)  it  is  clear  that  the  loss  here  is  determined  only  by  the  angular  distance  between 
the  vectors  Up(S)  and  UD(S).  If  R  is  equal  to  the  true  covariance  matrix  then  performance 
is  monotonic  in  SINR  and  (31)  completely  characterizes  the  loss  in  performance.  In  the  more 
general  case  the  performance  is  more  difficult  to  characterize  and  probability  of  detection 
may  be  more  appropriate  measure  of  performance.  In  these  cases  the  loss  in  performance,  as 
compared  to  the  optimum  case  for  the  case  where  R  is  equal  to  the  exact  covariance  matrix,  is 
a  random  variable  whose  distribution  has  been  characterized  for  some  cases  of  interest  [8,  9], 
including  those  where  the  reference  samples  used  to  estimate  R  are  statistically  equivalent 
to  the  actual  clutter-plus-noise  present  in  the  range  cell  under  test.  If  a  mismatch  is  present 
then  the  performance  of  the  ADPCA  scheme  may  actually  exceed  the  performance  of  the  test 
in  (8),  which  is  no  longer  optimum  due  to  the  poor  estimate  of  R.  We  study  this  interesting 
case  in  the  next  section. 
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4-6  Performance  Compared  to  Other  STAP  Schemes 


Consider  a  case  where  the  noise-plus-clutter  from  the  range  cell  under  test  and  the  noise- 
plus-clutter  observations  used  to  estimate  R  are  mismatched  so  that  each  has  a  different 
statistical  description.  For  simplicity  let  us  assume  the  clutter-plus-noise  observations  con¬ 
sist  of  additive  contributions  of  noise  and  clutter  and  that  the  noise  and  clutter  are  statis¬ 
tically  independent.  Assume  that  the  clutter  portion  of  the  reference  samples  is  Gaussian 
distributed  with  the  two  dimensional  power  spectral  density  (psd)  described  in  [1] 


Pc(ftJs) 


E^c.d 

— - - - exp 

d= i  27t ty fttd(T fS)d 


2 a%,d 


^a}a,d 


which  is  a  function  of  Doppler  frequency  ft  and  spatial  frequency  f3.  The  psd  in  (32)  models 
the  case  where  the  ground  clutter  is  assumed  to  be  dominant  over  other  sources  of  clutter. 
This  psd  allows  a  simple  approximation  to  the  clutter  ridge  which  would  be  obtained  from 
the  ground  clutter  received  by  a  moving  platform.  One  advantage  of  this  approximation  is 
that  (32)  can  be  inverted  easily  to  obtain  an  expression  for  the  covariance  matrix  [1],  The 
approximation  in  (32)  consists  of  six  Gaussian-shaped  humps,  the  dth  of  which  has  amplitude 
<7 ld,  is  centered  at  (ft,  fa)  =  (fct,d,  fca,d),  and  has  a  spread  in  angle  and  Doppler  controlled 
by  crjt  d  and  aj3  d.  To  be  specific  consider  a  case  where  the  training  data  is  described  by  the 
psd  in  (32)  with  the  specific  parameters  shown  in  Table  1.  This  psd  is  shown  in  Figure  3. 


Now  assume  that  the  range  cell  under  test  has  a  clutter  contribution  to  its  received 
signal  which  has  the  two-dimensional  psd  shown  in  Figure  4  which  has  the  same  parameters 
as  for  the  psd  shown  in  Figure  3  except  that  it  has  ac%  =  1.155.  The  psds  in  Figure  3  and 
Figure  4  could  model  a  case  where  there  is  a  large  clutter  return  from  a  few  discrete  ground- 
based  scatterers  which  are  not  present  in  the  training  data.  Such  cases  have  been  observed 
in  some  recently  measured  airborne  data  [10].  Alternatively  the  difference  in  the  psds  in 
Figure  3  and  Figure  4  could  be  the  result  of  jamming  [11].  For  both  the  reference  samples 
and  for  the  data  under  test  we  assume  the  noise  contribution  to  the  noise-plus-clutter  is 
also  Gaussian  distributed  and  that  the  observations  at  different  sensors  and  from  different 


d 

&c.d 

aft,d 

ai*A 

fct,d 

fcs,d 

1 

0.5774 

0.0217 

0.0909 

-0.1522 

-0.3182 

2 

0.5774 

0.0217 

0.0909 

-0.0652 

-0.1364 

3 

1.0 

0.0217 

0.0909 

0.0217 

0.0455 

4 

0.5774 

0.0217 

0.0909 

0.1087 

-0.2273 

5 

0.5774 

0.0217 

0.0909 

0.1957 

0.4091 

6 

0.5774 

0.0217 

0.0909 

0.2826 

0.5909 

Table  1:  Parameters  of  assumed  psd  for  training  samples. 

pulses  are  statistically  independent  with  constant  power  of  0.0001  so  that  the  noise  power 
is  significantly  less  than  that  of  the  clutter,  a  case  which  arises  frequently  in  practice.  This 
leads  to  a  covariance  matrix  which  is  diagonal  with  the  constant  power  along  the  diagonal. 
The  overall  noise-plus-clutter  covariance  matrix  is  the  sum  of  the  covariance  matrix  from 
each  acting  alone. 

Consider  a  case  with  two  sensors  and  three  pulses  and  assume  that  a  sufficiently  large 
number  of  reference  samples  are  used  so  that  a  perfect  estimate  of  the  covariance  matrix 
is  made  based  on  the  training  samples.  Consider  the  case  of  a  single  target  at  an  azimuth 
angle  (f>  =  0  in  (7)  and  at  a  normalized  Doppler  frequency  ut  =  0.3  in  (10).  Assume  that  the 
signal  and  noise-plus-clutter  combine  in  an  additive  way  so  that 

V(X)  =  aV(S )  +  V(C)  (33) 

where  V(X)  is  the  observed  data  vector  used  in  (1),  V^S)  is  steering  vector  is  defined  in 
(8),  and  V(C )  represents  the  additive  noise-plus-clutter  vector.  In  (33)  a  is  an  complex 
number  whose  amplitude  is  unknown  (thus  we  don’t  know  if  target  is  present)  and  whose 
phase  is  typically  assumed  to  be  uniform  or  possibly  known  (possibly  from  a  previous  scan). 
We  compared  the  probability  of  detection  performance  of  the  scheme  which  uses  the  test 
statistic  in  (1)  with  the  steering  vector  in  (8),  which  we  call  fully  adaptive  STAP  here,  to 
that  obtained  when  the  ADPCA  scheme  is  used.  For  the  case  of  mismatched  training  data 
outlined  here,  a  Monte  Carlo  simulation  using  100000  runs  produced  the  results  in  Figure  5 
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Figure  3:  Two  dimensional  power  spectral  density  of  the  training  data. 

for  the  case  where  the  phase  of  a  is  known.  Another  Monte  Carlo  simulation  produced 
the  results  in  Figure  6  for  the  case  where  a  has  uniform  phase.  Larger  run  lengths  were 
also  tested,  but  produced  little  change  in  the  results.  The  ADPCA  scheme  outperforms 
the  fully  adaptive  STAP  scheme  for  all  signal  amplitudes.  Similar  results  were  obtained  for 
cases  with  different  values  of  ajtd  and  cryad,  including  small  cr|td  and  0/5)d.  This  indicates 
that  the  ADPCA  scheme  outperforms  the  fully  adaptive  STAP  scheme  in  these  cases,  even 
if  the  mismatch  is  due  to  only  a  single  discrete  ground-based  scatterer.  The  reason  for 
the  improvement  in  performance  in  the  ADPCA  scheme  is  apparently  based  on  its  pulse 
canceling  structure,  which  allows  this  scheme  to  cancel  clutter  with  high  correlation  across 
several  pulses  even  if  this  correlation  is  not  present  in  the  training  data.  Recall  that  for  cases 
where  the  training  data  does  not  exhibit  dependence,  where  R_1  is  an  identity  matrix,  then 
ADPCA  implements  a  pulse  canceling  scheme  exactly. 
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Figure  4:  Two  dimensional  power  spectral  density  of  the  cell  under  test. 

4-7  Conclusion 

An  analysis  of  the  ADPCA  STAP  scheme  has  been  given.  Conditions  under  which  this 
scheme  is  optimum  are  given  for  the  case  of  observations  with  a  known  covariance  matrix. 
We  demonstrate  that  these  conditions  can  occur  if  exactly  two  pulses  are  used  in  the  STAP 
scheme  and  that  the  conditions  can  not  be  satisfied  for  ADPCA  schemes  which  use  more  than 
two  pulses.  However,  there  is  a  nice  interpretation  of  the  ADPCA  scheme  which  may  explain 
its  good  performance  under  some  conditions.  For  cases  with  noise  and  ground  clutter  only, 
the  ADPCA  scheme  sometimes  approximates  a  pulse  canceling  scheme  applied  to  optimally 
processed  pairs  of  pulses.  The  approximation  is  exact  for  cases  where  the  clutter  arrives  with 
zero  azimuth  angle,  but  it  is  very  nearly  true  for  cases  with  three  pulses  for  ground  clutter 
arriving  at  any  angle.  Finally,  we  analyzed  the  detection  performance  for  a  case  where  the 
training  data  does  not  characterize  all  of  the  clutter  and  we  showed  cases  where  the  ADPCA 
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Figure  5:  Probability  of  detection  of  ADPCA  as  compared  to  fully  adaptive  STAP  for  a 
case  with  additional  interference  which  was  not  present  in  the  training  data,  a  false  alarm 
probability  of  0.1  and  known  target  phase. 

scheme  outperforms  the  fully  adaptive  STAP  scheme,  which  is  optimum  for  the  case  where 
R  is  equal  to  the  true  covariance  matrix. 
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Figure  6:  Probability  of  detection  of  ADPCA  as  compared  to  fully  adaptive  STAP  for  a 
case  with  additional  interference  which  was  not  present  in  the  training  data,  a  false  alarm 
probability  of  0.1  and  uniformly  distributed  target  phase. 
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This  report  considers  the  problem  of  partitioning  radar  signal  data  according  to  their  covariance 
structures.  A  partitioning  procedure  is  developed  from  the  classical  ranking  and  selection  approach  to 
include  in  the  selected  subset  those  secondary  populations  which  have  the  same  covariance  matrix  as  the 
primary  population  and  exclude  those  populations  which  have  significantly  different  covariance  matrices 
from  the  primary  population.  The  formulas  for  the  performance  measure,  the  probability  of  a  correct 
partition,  is  derived  and  the  least  favorable  configuration  is  found  so  that  the  sample  size  needed  to 
guarantee  certain  probability  requirement  can  be  computed.  Guideline  for  further  research  is  given  at  the 
end  of  this  report. 
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PARTITIONING  PROCEDURE  IN  RADAR  SIGNAL  PROCESSING  PROBLEMS 
Pinyuen  Chen  William  Melvin  Michael  Wicks 


1.  INTRODUCTION 

The  problems  of  statistical  inference  are  basically  treated  as  those  of  estimation  (point  and 
interval)  or  testing  of  hypotheses.  This  formulation  does  not  suit  the  objectives  of  an  experimenter  in 
situations  when  one  needs  to  compare  among  several  alternatives.  These  may  be,  for  example,  different 
varieties  of  a  grain,  different  competing  manufacturing  processes  for  an  industrial  product,  different 
drugs  (treatments)  for  a  specific  disease,  or  clutters  with  different  covariance  structures  from  the  signal 
processing  data.  Some  other  examples  are  comparison  of  common  stocks  for  investment  and  traffic 
fatality  rates  of  several  states.  In  all  these  problems,  we  have  k  ( >  1)  populations  and  each  population  is 
characterized  by  the  value  of  a  parameter  of  interest  8,  which  may  be,  in  the  example  of  signal 
processing,  an  appropriate  measure  of  the  covariance  matrix  of  a  clutter.  In  the  example  of  comparing 
common  stock,  an  appropriate  measure  may  be  the  average  ratio  of  price  to  earnings. 

The  classical  approach  in  statistics  to  the  preceding  examples  is  to  test  the  null  hypothesis  of 
homogeneity  H<,:  01=02=...  =  0i  where  0  j ,  8  2 , , ,  0  *  are  the  values  of  the  parameter  0  for  these 
populations.  Tests  based  on  Neyman-Pearson  theory  have  been  developed  for  various  such  situations 
and  these  are  available  in  the  statistics  literature. 

If  a  test  of  homogeneity  is  the  primary  and  final  goal  of  an  investigator,  alternative  methods  of 
statistical  analysis  are  not  needed.  However,  there  are  many  practical  situations  in  which  other  kinds  of 
information  or  other  goals  are  of  interest.  The  experimenter  may  be  interested  in  (1)  determining  which 
populations  differ  from  which  others,  and  in  what  direction,  or  (2)  to  see  which  populations  can  be 
considered  best  in  some  well-defined  sense  of  “best”.  For  the  example  of  signal  processing  problem,  we 
might  be  interested  in  selecting  those  secondary  data  (or  the  reference  data)  which  are  more  comparable 
to  the  primary  data  (or  the  test  data)  in  their  covariance  structures.  After  the  selection  process,  we  shall 
use  the  selected  data  to  test  the  signal  by  using  the  conventional  testing  procedures  which  were  designed 
under  the  ideal  situation  while  the  secondary  data  and  the  primary  data  have  the  same  covariance 
structure.  The  ranking  and  selection  procedures  have  been  designed  specifically  to  resolve  such 
problems. 

The  formulation  of  ranking  and  selection  procedures  has  been  generally  accomplished  under 
one  of  two  basic  approaches,  namely,  the  indifference  zone  approach  (IZ)  and  the  subset  selection 
approach  (SS).  Most  of  the  literature  in  ranking  and  selection  theory  comes  under  one  of  these  two 
approaches  and  modifications.  The  main  concern  of  this  report,  namely,  screening  procedures  for  the 
secondary  data  which  have  different  covariance  structures  from  the  primary  data,  falls  into  the  category 
of  a  modification  of  the  subset  selection  formulation:  partitioning  populations  (secondary  data)  with 
respect  to  a  control  (test  cell  data).  Here  we  shall  introduce  the  basic  concepts  of  the  IZ  and  the  SS 
approaches.  Partitioning  approach  which  is  more  pertinent  to  radar  signal  processing  data  will  be 
defined  formally  in  Section  2. 

Consider  k  independent  populations  7tj,7t2,...,rtjt  where  7t has  the  underlying  distribution 
F0j ,  i  -  1,  2,  k.  The  0,-  are  unknown  real- valued  parameters  which  represent  the  value  of  a  quality 
characteristic  of  interest  for  these  k  populations.  We  define  n to  be  better  than  n  j  if  0  ,•  >  Qj .  The 
ordered  0,  are  denoted  by  0^]  <...<0^] .  For  the  basic  problem  of  selecting  the  best  population,  the 

indifference  zone  formulation  of  Bechhofer  (1954)  defines  that  the  selection  of  the  population 
associated  with  0^]  results  in  a  correct  selection  (CS).  He  required  that  for  any  procedure  R  to  be 

valid,  R  should  guarantee  a  specified  minimum  probability  of  a  correct  selection  P(CS),  say  P*  (1/k  < 
P*  <  1)  whenever  the  best  and  the  second  best  populations  are  apart  at  least  by  a  specified  amount.  Let 
8(8;  ,0  j)  denote  an  appropriately  defined  non-negative  measure  of  the  amount  of  separation  between 

the  population  associated  with  0  ,•  and  Qj .  For  any  specified  8*  >  0,  let  ft  5*  be  the  subset  of  the 
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parameter  space  £2  =  {010  =  (0i,...,0^)}  defined  by  £25*  =  {0I8(0[^,0^])  >8*} .  The  subset  Q5*  is 
called  the  preference  zone.  Letting  P(CS1R)  denote  the  PCS  of  a  rule  R,  in  order  to  be  valid,  it  should 
satisfy  P(CS1R)  >  P*  for  all  0  e£2g*.  Both  8*  and  P*  are  specified  by  the  experimenter  in  advance. 
Suppose  R  is  based  on  samples  of  fixed  size  n  from  each  population.  The  problem  then  is  to  determine 
the  smallest  sample  size  n  for  which  the  P*  requirement  holds.  The  compliment  of  the  preference  zone 
£2 5*  is  called  the  indifference  zone  where  no  requirement  is  made  on  the  PCS. 

In  the  subset  selection  approach  of  Gupta  (1956),  the  basic  problem  is  to  select  a  non-empty 
subset  of  the  k  given  populations  so  that  the  best  population  is  included  in  the  selected  subset  with  a 
guaranteed  minimum  probability  P*.  In  case  of  a  tie  for  the  best  place,  we  assume  that  one  of  the 
contenders  is  tagged  as  the  best.  Selection  of  any  subset  that  includes  the  best  results  in  a  correct 
selection.  Any  valid  rule  R  should  satisfy  P(CSIR)  >  P*  for  all  0  e  £2  .  Note  that  the  size  S  of  the 
selected  subset  is  not  decided  in  advance  but  is  determined  by  the  data. 

The  procedures  should  satisfy  the  probability  requirement  P*.  Any  parameter  configuration  0 
which  yields  the  infimum  of  the  P(CS)  over  the  preference  zone  £25*  or  the  entire  parameter  space  £2, 
depending  on  the  approach  (IZ  or  SS),  is  called  the  least  favorable  configuration  (LFC).  Many 
variations  and  generalizations  of  the  basic  formulation  using  either  of  the  two  approaches  have  been 
studied.  One  important  related  selection  problem  is  selecting  populations  better  than  a  standard  or  a 
control  it  q  .  In  our  study  in  selection  procedure  based  on  radar  signal  processing  data,  the  standard  or 
control  can  be  taken  as  the  primary  data.  The  re-grouped  secondary  data  are  those  independent 
populations  it  1 , Jt  2 ,  ■  •  •  ,  it  k  .  We  will  be  interested  in  selecting  those  populations  from  those  secondary 
data it1,it2,...,itt  that  have  the  same  covariance  structure  as  the  control  population  from  the  primary 
data. 


Radar  signal  processing  data  usually  comes  in  the  forms  of  random  vectors  and  they  were 
analyzed  statistically  in  the  framework  of  multivariate  analysis.  In  Section  2,  we  will  briefly  review  the 
selection  theory  in  multivariate  analysis.  In  Section  3,  we  try  to  relate  the  radar  signal  processing 
problem  and  the  testing  procedure  studied  in  Chen  (1994)  to  the  statistical  selection  concepts  introduced 
in  Sections  1  and  2.  In  Section  4,  we  formally  define  the  partitioning  formulation  of  selection  theory  that 
will  be  used  in  our  applications  in  radar  data.  In  Section  4,  we  will  also  propose  a  procedure  to  solve  the 
partitioning  problems  for  radar  signal  processing  data.  We  will  derive  P(CP),  the  probability  of  a  correct 
partition,  for  the  proposed  procedure  in  Section  5.  Final  remarks  for  future  investigations  will  be  given 
in  Section  6. 


2.  MULTIVARIATE  SELECTION  PROBLEMS 

A  multivariate  distribution  is  the  joint  probability  distribution  of  p  variables  or  components  where 
p  >  1.  A  single  observation  from  a  multivariate  population  is  a  measurement  on  each  of  these 
components,  that  is,  a  pair  of  measurements  if  p=2,  a  triplet  of  measurements  if  p=3,  and  in  general,  a  p- 
tuple  of  measurements.  Therefore  a  random  sample  of  size  n  from  a  multivariate  distribution  consists  of 
a  number  n  of  p-tuples  of  measurements,  one  p-tuple  for  each  of  the  n  sample  observations.  We 
sometimes  refer  to  these  as  vector  observations.  The  main  multivariate  distribution  considered  in  this 
report  is  the  multivariate  normal  distribution  (complex-  or  real-  valued)  which  has  as  parameters  not 
only  the  means  and  variances  of  each  of  the  p  variables,  but  also  covariances  between  pairs  of  these 
components.  A  multivariate  normal  distribution  with  mean  vector  p  and  covariance  matrix  2  is 
denoted  by 

Let  k  be  k  p-variate  normal  populations,  Np (p.,- , Zf ) ,  i  =  1,  2,  k.  We  consider 

several  measures  0  for  selecting  the  populations  such  as  the  generalized  variance  and  the  Mahalanobis 
distance.  These  measures  0  are  unknown  real- valued  parameters  which  represent  the  value  of  a  quality 
characteristic  of  interest  for  these  k  populations.  In  the  real  world  applications  of  statistical  ranking  and 
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selection,  the  experimenter  usually  first  decide  which  measure  is  to  be  used  to  rank  the  populations. 
Then  he/she  will  study  selection  procedures  based  on  a  good  estimate  of  0  .  The  review  of  multivariate 
selection  problems  here  is  to  introduce  some  familiar  measures  and  the  procedures  that  statisticians  use 
in  general  practice.  Those  measures  may  not  be  applicable  in  our  study  in  radar  signal  processing  which, 
as  in  any  field  of  specialty,  has  its  own  unique  properties.  However,  as  it  will  become  clear  in  Section  3, 
the  general  format  of  our  selection  goal  and  probability  requirement  in  radar  signal  processing  problems 
follows  closely  to  those  of  existing  procedures  that  are  reviewed  here. 

2.1  Selection  in  terms  of  Mahalanobis  Distance 

Let  A,-  =  ,  the  Mahalanobis  distance  from  the  origin.  We  consider  the  selection  of  the 

population  associated  with  the  largest  A,- . 

2.1.1  Subset  Selection  Approach 

Let  =  denote  n  independent  observations  from  =  Define 

(2.1)  Yij  =  ,  Yt  =  1 1 \j,Zt  =  XiX~lXi , 

where  X(  and 

S{  are  the  sample  mean  vector  and  covariance  matrices. 

Case  1:  When  the  covariance  matrices  E;-  are  known: 

In  this  case,  Gupta  (1966)  proposed  the  rule 

(2.2)  Ra  :  Select  n,  if  and  only  if  L  >  c  Ylk] 

where  0  <  c  =  c(k,p,n,  P*)  <  1  is  the  largest  number  for  which  the  P*-condition  is  satisfied.  It  has  been 
shown  that  the  LFC  is  A  =  (Aq,...,^)  =  (0,...0) .  Thus  the  constant  c  in  the  procedure  can  be 
found  to  be  the  solution  of  the  following  integral  equation: 

(2.3)  J^°  Gy -1  (*  /  c)dGv  (*)  =  P* 

where  G/x)  is  the  distribution  function  of  a  Gamma  random  variable  with  probability  density  function 

(2.4)  g(x)  =  *>0. 

The  values  of  c  to  implement  the  procedure  can  be  found  in  Gupta  (1963).  For  the  analogous  procedure 
for  selecting  the  population  associated  with  the  smallest  A,- ,  the  procedure  Rc  defined  in  (2.2)  can  be 
modified  to  “select  7t /  if  and  only  if  T,  <d  Y[„“.  The  appropriate  constant  d  can  be  found  in  Gupta  and 
Sobel  (1962). 

It  is  natural  to  use  the  statistic  Z;  instead  of  the  Y;  in  the  rule  Rc.  .  In  this  situation  however,  the 
infimum  of  the  P(CS)  and  hence  the  procedure  parameter  c  do  not  dependent  on  n,  the  sample  size  of 
the  experiment;  this  is  an  unsatisfactory  feature  for  a  statistical  inference  procedure.  One  can.  of  course, 
study  a  different  type  of  procedure,  for  example,  “select  7t /  if  and  only  if  Y{  <  Y[kr-d"  d  >  0.  Such  a 
procedure  has  not  been  investigated  in  the  literature. 

Case  2:  When  the  covariance  matrices  E;  are  not  known: 

In  this  case  Gupta  and  Studden  (1970)  proposed  the  rule 
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(2.5)  Rgs  :  Select  71/  if  and  only  if  T-,  -  e  T[kl 

where  0  <  e  =  e(k,p,n, P*)  <  1  is  to  be  chosen  suitably  to  satisfy  the  P*-condition.  As  in  the  previous 
Case  1,  theLFC  is  )  =  (0, ...0)  and  the  constant  e  is  given  by 

<2-6>  Jo  Fp,n-pU/e)dFp_„-p(x)  =P* 


where  Fp  n.p(x)  is  the  distribution  function  of  a  F  random  variable  with  probability  density  function 

1 


(2.7) 


/(*)  = 


B(~P,~(n-p)) 


x>0. 


Values  of  e  to  implement  the  procedure  have  been  tabulated  for  selected  values  of  k,  n,  p,  and  P*  by 
Gupta  and  Panchapakesan  (1969),  who  also  tabulated  the  needed  constant  h  for  the  analogous  rule 
“Select  7t/  if  and  only  if  7)  <hTm“  for  selecting  the  population  associated  with  the  smallest  A,f . 


2.1.2:  Indifference  Zone  Approach 

Alam  and  Rizvi  (1966)  have  investigated  procedures  for  selecting  the  populations  associated  with 
the  t  largest  Xi ,  1  <  t  <  k-1,  by  taking  for  the  preference  zone  Q(S  \  *,8  2*)  =  j  *  fT  ^8 2  * 

where 


(2.8) 

^Sj*  =  f+ij  —  —Si*}  and  ^s2*  =  $:^[jfc-7+l]  ^[k-7] -$2*} 


for  specified  8/*  >  0  and  82*  >  1.  They  considered  the  natural  selection  rule,  namely, 

(2.9)  Rar:  Select  the  populations  that  yielded  the  t  largest  Z,  in  the  case  of  known  X/  ‘s 

and  the  t  largest  7)  in  the  case  of  unknown  X/  ‘s. 

The  LFC  is  given  by 

(2.10)  Xra  =...X[t_tJ  =8,  *(«2  *-l)_,,X[t.,+11  =...x[t]  =  8,  *82  *(82  '-l)"1. 

The  smallest  value  of  n  required  to  satisfy  the  P*  condition  can  be  obtained  in  the  case  of  known  X/  ‘s 
from 


(2.11) 


nb\  *82  * 

80  *-l 
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)}  gp(x> 


nd\ *82  * 
8?  *-l 


)dx  =  P* 


where  gp(x,Q)  and  Gp(x,Q)  denote  the  probability  density  function  and  the  distribution  function, 

respectively,  of  a  non-central  chi-square  random  variable  with  p  degrees  of  freedom  and  noncentrality 
parameter  8.  The  smallest  value  of  n  required  to  satisfy  the  P*  condition  can  be  obtained  in  the  case  of 
unknown  X/  ‘s  from 


(2.12)  t  £  Fp,n-p(x,^—){l- Fp  (x. 


”81*  U1-F  ll 

80  *-l  1  p(  ’  89  *-l 


7-1 

)}  / p,n—i 


(x. 


»5l*82 

82*-! 


* 

-)dx  =  P* 
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where  fpn_  p(x,Q)  and  Fpn_p(x,Q)  denote  the  probability  density  function  and  the  distribution 

function,  respectively,  of  a  non-central  F  random  variable  with  p  and  n-p  degrees  of  freedom  and 
noncentrality  parameter  0. 

2.2  Selection  in  terms  of  the  Generalized  Variance 

The  generalized  variance  0  =1X1  of  a  multivariate  normal  population  with  covariance  matrix  2 
serves  as  an  over-all  measure  of  the  variability  among  the  components.  Gnanadesikan  and  Gupta  (1970) 
considered  selection  of  the  population  associated  with  the  smallest  0  j .  They  proposed  a  subset 
selection  rule 

(2.13)  Rgg  :  Select  rc,  if  and  only  if  Wj  <  l  W[t] 

where  W;  =1  St  I  is  the  sample  generalized  variance  and  0  <  1  =  l(k,p,n,P*)  <  1  is  to  be  chosen  to  meet 
the  P*  condition.  They  have  shown  that 

(2.14)  inf  P(CS\Rgg)  =  P(Y\  <  y  Yj ,  j  =  2 

where  the  Yj  are  i.i.d.,  each  distributed  as  a  product  of  p  independent  factors  where  the  rth  factor  has  a 
Chi-square  distribution  with  (n-r)  degrees  of  freedom.  An  exact  solution  for  l  is  obtained  for  p=2  and  is 
tabulated  by  Gupta  and  Sobel  (1962).  For  p  >  2,  Gnanadesikan  and  Gupta  (1970)  have  studied 
approximations  using  the  normal  approximation  of  log  x2  and  Hoel’s  approximation  of  the  distribution 

of  P  by  an  appropriate  gamma  distribution. 

Eaton  (1967)  considered  a  decision-theoretic  approach  to  ranking  the  k  populations  according  to 
the  values  of  the  0  ,• ,  assuming  reasonable  properties  for  the  loss  function  which  dependents  only  on  the 
0  ,• .  He  showed  that  the  natural  rule  which  ranks  the  populations  according  to  the  values  of  Wi  is 
minimax,  admissible,  and  uniformly  the  best  among  rules  that  are  invariant  under  permutations  of 

As  we  can  see  from  the  rules  described  in  (2.2),  (2.5),  and  (2.12)  above,  the  selection  procedures 
under  the  SS  approach  (i.e.,  when  the  goal  is  to  select  a  subset  containing  the  best  or  to  eliminate  bad 
populations)  are  all  of  the  ratio  type  whose  decision  rule  depends  on  the  ratio  of  the  statistics  of  two 
samples.  Another  type  used  as  commonly  as  the  ratio  type  in  univariate  selection  theory  is  the  difference 
type  whose  decision  rule  depends  on  the  difference  of  the  statistics  of  two  samples.  The  difference  type 
procedures  have  not  been  frilly  explored  in  multivariate  selection  theory.  The  procedure  described  in 
(2.9)  is  neither  ratio  nor  difference  type  since  under  the  indifference  zone  approach  one  is  interested  in 
selecting  only  the  best  population,  not  a  subset  containing  the  best. 

In  the  multivariate  selection  problems  we  have  discussed  so  far,  the  multivariate  normal 
populations  have  been  ranked  according  to  the  values  of  a  scalar  function  of  the  parameters  which  are 
usually  in  vector  or  matrix  form.  In  the  indifference  zone  approach,  a  measure  for  the  scalar  function  of 
the  parameters  would  be  used  to  define  the  indifference  zone.  Then  a  procedure  based  on  the  estimators, 
the  so-called  selection  statistic,  of  the  scalar  function  of  the  parameters  would  be  proposed  for  the 
selection  goal.  In  the  subset  selection  approach,  we  usually  do  not  define  the  indifference  zone.  Thus  the 
measure  for  the  parameters  is  not  needed.  However  the  selection  procedure,  again  defined  by  the 
estimator  of  the  parameters  of  interest,  would  depend  on  some  measure  of  the  parameters.  For  example 

in  Section  2.1,  ,  the  Mahalanobis  distance  from  the  origin,  is  the  scalar  function  of  the 

parameters  of  interest.  The  measure  used  in  the  subset  selection  procedure  “Rc  :  Select  n /  if  and  only  if 
Yj>c  Y[kI  ”  is  ratio.  Notice  that  the  procedure  also  depends  on  the  estimator  y  of  the  parameters.  The 
statistic  y  is  the  selection  statistic  here.  In  the  indifference  zone  approach,  both  the  “difference”  and  the 
“ratio”, 
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^5]*  -  {^:^[jfe-f+i]  and  &82*  =  $:^[ifc-r+l]  !\k-t] -82*} 

are  used  to  define  the  indifference  zone.  Again,  the  selection  procedure  depends  on  the  selection 
statistic.  In  order  to  implement  a  selection  procedure,  we  need  to  derive  the  PCS  and  the  LFC.  That  is 
where  the  distribution  theory  of  the  selection  statistic  comes  into  the  picture. 

In  a  recent  paper,  Bofinger  (1992)  has  considered  the  problem  of  multiple  comparisons  with  the  best 
for  multivariate  normal  populations  using  a  “multivariate”  approach.  Bofinger’s  results  are  mainly  for 
bivariate  normal  populations.  She  finds  that  “  for  comparisons  with  the  “best”  of  each  variate,  repeated 
univariate  comparisons  appear  to  be  almost  as  efficient  as  multivariate  comparison,  at  least  for  the 
bivariate  case  and,  under  certain  circumstances,  for  higher  dimensional  cases”.  These  aspects  of  the 
selection  and  related  inference  problems  are  worth  exploring  further. 


3.  APPLICATIONS  OF  SELECTION  THEORY  TO  RADAR  SIGNAL  PROCESSING 

Statistical  inferences  in  radar  signal  processing  problem  usually  involve  two  of  the  major  areas  in 
statistical  research.  These  two  areas  are  (1)  testing  of  hypotheses  and  (2)  multivariate  analysis.  The  null 
hypothesis  of  the  mean  vector  equals  to  0  (i.  e.,  Hq'.\L  =  0 )  implies  that  these  is  no  target  in  the  test 

cell.  The  alternative  hypothesis  of  the  mean  vector  equals  to  a  given  non-zero  vector  (i.e.,  Hq\  p  =  ^ ) 

implies  the  present  of  a  target  in  the  test  cell.  Since  all  the  data  obtained  from  radar  systems  are  assumed 
to  be  correlated  multi-components  vectors  (usually  complex-valued  vectors),  the  testing  procedures 
usually  fall  into  the  framework  of  multivariate  analysis.  Although  some  attentions  in  radar  problems 
have  been  given  to  non-normal  theory  recently,  we  will  focus  our  study  on  normal  (Gaussian)  theory. 

Classical  radar  detection  theory  was  developed  under  the  assumption  that  target  returns  follow  a 
homogeneous  Gaussian  noise.  The  term  “homogeneity”  here  means  the  covariance  matrices  of  the 
reference  cells  (the  secondary  data)  all  have  the  same  structure  as  the  covariance  matrix  of  the  test  cell 
(the  primary  data).  Under  this  assumptions,  the  likelihood  ratio  tests  and  their  modifications  provide 
optimum  signal  processors  whose  performances  have  been  well  studied  and  documented  in  literature 
(see  for  examples,  Kelly  (1986),  Wang  and  Cai  (1990),  Cai  and  Wang  (1990)).  As  it  was  well  reported 
in  Wicks  (1993),  the  detection  of  a  target  in  a  heterogeneous  (non-homogenous)  clutter  and  noise 
interference  environment  in  modern  radar  is  an  important  and  challenging  signal  processing  problem. 
The  classical  detection  algorithms,  designed  for  homogeneous  situation,  may  suffer  a  significant  loss  in 
their  performances  if  the  true  environment  is  heterogeneous.  Statistically,  heterogeneous  problems  can 
be  handled  in  two  manners  :  (1)  One  can  develop  algorithms  for  the  heterogeneous  case  and  compare 
their  performances  with  those  of  the  algorithms  developed  for  the  homogenous  case;  (2)  One  can  make 
inferences  about  the  covariance  structures  of  the  reference  data.  If  the  covariance  structure  of  the 
reference  cells  is  tested  or  estimated  to  have  the  same  structure  as  that  of  the  test  cell,  we  can  apply  the 
existing  algorithms  which  were  designed  under  homogenous  environment  to  the  reference  data. 

Raghavan,  Qiu,  and  McLaughlin  (1995)  made  an  attempt  to  the  approach  (1).  They  considered 
the  detection  problem  assuming  the  clutter  has  unknown  correlation  properties.  However,  the 
distribution  theory  needed  for  calculating  the  performance  measures  (  probability  of  false  alarm  and 
probability  of  detection)  of  their  proposed  statistic  is  far  from  complete.  Therefore,  their  probability  in 
the  null  hypothesis  (i.e.,  the  probability  of  false  alarm)  could  be  computed  only  when  the  signal  is  a 
deterministic  value  which  causes  no  change  on  the  second  order  statistic  (the  covariance)  structure. 
Their  probability  in  the  alternative  hypothesis  (i.e.,  the  probability  of  detection)  was  not  computed  for 

the  test  statistic  z  S  z  proposed  in  the  paper.  In  stead  of  using  the  sample  covariance  matrix  S  in 
the  above  test  statistic,  they  used  an  approximated  covariance  matrix  Rc  which  is  available  only  under 
large  sample  assumption.  Strictly  speaking,  a  complete  solution  to  the  radar  problem  parallel  to  Kelly 

(1986)’s  result  should  consists  of  a  distribution  theory  for  the  test  statistic  Z  S  Z  while  z  is  a 
normal  random  vector  with  zero  mean  (under  the  null  hypothesis)  and  non-zero  mean  (under  the 
alternative  hypotheses)  and  S  is  a  Wishart  random  matrix,  distributed  independently  of  z,  with  a 
covariance  matrix  Rc  in  a  different  form  from  Rs ,  the  covariance  matrix  of  z. 
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Then  one  can  make  comparison  between  their  procedure  with  Kelly’s  GLR  procedure  as  they  tried  to  do 
in  the  paper. 

Chen  (1994)  made  an  attempt  to  the  second  approach  (2).  In  the  report,  the  author  studied  a  test 

H  —1 

statistic  which  took  exactly  the  same  form  as  the  test  statistic  z  S  Z  studied  in  Raghavan,  Qiu,  and 
McLaughlin  (1995).  However,  the  goal  of  Chen  (1994)  was  to  test  if  the  covariance  structure  is  the  same 
for  the  primary  data  and  for  the  secondary  data  while  the  goal  of  Raghavan,  Qiu,  and  McLaughlin 
(1995)  was  to  test  if  a  signal  is  present  in  the  primary  data.  Chen  (1994)  derived  the  distribution  of  the 

test  statistic  zH  S~lz  while  z  is  a  normal  random  vector  with  zero  mean  (under  both  the  null  hypothesis 
and  the  alternative  hypotheses)  and  S  is  a  Wishart  random  matrix,  distributed  independently  of  z,  with  a 
covariance  matrix  Rc  in  a  different  form  from  Rs ,  the  covariance  matrix  of  z.  Based  on  the  findings  in 
Chen  (1994),  the  research  in  this  report  is  a  ranking  and  selection  aspect  of  the  second  approach  (2). 

As  mentioned  in  Section  1,  if  a  test  of  homogeneity  is  the  final  goal  of  an  investigator,  alternative 
analysis  is  not  needed.  However  in  our  radar  detection  applications,  we  are  not  only  interested  in  which 
secondary  data  have  the  same  covariance  structure  as  that  of  the  test  cell.  We  are  also  interested  in 
selecting  those  clutters  whose  covariance  structure  is  the  same  as  that  of  the  test  cell  so  that  a  follow-up 
testing  on  the  signal  by  traditional  algorithms  can  be  applied.  The  ranking  and  selection  theory  has  been 
designed  specifically  to  solve  such  problem. 

The  indifference  zone  approach  of  ranking  and  selection  theory  does  not  meet  our  need  in  radar 
signal  detection  problem  because  it  selects  only  the  best  population.  The  usual  subset  selection  approach 
is  more  appropriate  since  it  eliminates  the  bad  populations.  In  our  application  of  radar  detection,  we 
need  a  procedure  which  can  eliminate  those  secondary  data  which  have  different  covariance  structures 
(in  the  language  of  ranking  and  selection  the  “bad  populations”)  from  the  primary  data.  Suppose  that  the 
secondary  data  comes  from  the  independent  random  vectors  Y\ ,  Y^ , .  • . ,  Yg  and  the  primary  data  comes 

from  the  random  vector  Yq  .  Assume  that  we  can  divide  the  vectors  Y\ ,  Yi , . .  • ,  Y%  into  k  subgroups  of 
vectors  {7^  ,...,Ybl  },{Yai  ,...,Yb2  }»•••>%*  >•••>%*  }  where  each  subgroup  serves  as  a 

population.  This  structure  can  also  be  seen  in  a  multiband  data  discussed  in  Cai  and  Wang  (1990)  where 
they  referred  k  as  the  number  of  subbands.  Thus  our  goal  is,  with  high  probability  of  confidence,  to 
eliminate  those  subgroups  whose  covariance  structures  are  significantly  different  from  the  primary  data 
(or  the  control  population  in  the  language  of  ranking  and  selection  theory).  To  achieve  the  goal,  we  need 
to  modify  the  classical  subset  selection  approach  so  that  the  idea  of  comparing  with  a  control  is  included 
in  the  formulation.  The  new  formulation,  which  will  be  called  partitioning  with  respect  to  a  control,  will 
be  formally  defined  in  the  next  section.  We  will  also  propose  a  procedure,  based  on  the  test  statistic 
H  —1 

Z  S  z  studied  in  Chen(1994),  to  select  the  good  populations  and  to  eliminate  the  bad  populations. 

4.  PARTITIONING  WITH  RESPECT  TO  A  CONTROL 
4.1  General  Formulation: 

This  sub-section  is  concerned  with  a  general  problem  of  partitioning  a  set  of  independent 
populations  into  two  subsets  according  to  their  parameters  with  respect  to  a  control  population.  The 
distribution  model  for  this  type  of  experiment  is  k  populations  /  (x;9  j ),  /  (x;0  2  (*i0  jfc  )»  and 

a  single  control  population  /(x',Qq)  .  Each  of  these  k+1  distributions  is  of  the  same  form;  they  differ 
only  by  the  parameters  which  measures  the  goodness  of  the  respective  populations.  The  values 
of01,02,...,S  £  are  all  unknown  and  the  value  of  0  q  may  be  either  known  or  unknown.  For  arbitrary 

but  fixed  constants 8 1 ,82 .  we  define  two  disjoint  and  exhaustive  subsets  Q&,  and  £2B  of  the  set 
Q=  {712,71 2 by 
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(4.1) 

QG  =  {7^:82  <rf(e,-,e0)<6t},Q5  =  {7tI:rf(ei,0o)<82}u{7CI:flf(0I-,eo)>8]!} 

where  d(x,y)  is  a  distance  measure  of  the  parameters.  The  most  commonly  used  measures  d  are  the 
difference  and  the  ratio.  After  observations  have  been  taken,  the  set  £2  =  {7T  j ,7T 2 ^  }  is 
partitioned  into  two  disjoint  subsets  SG  and  SB  (the  subscripts  G  stands  for  “good”,  B  stands  for  “bad”). 
The  definition  of  a  correct  partition  is  defined  as  follows: 

(4.2)  A  partition  is  correct  (CP)  if  £2G  CZ  Sq  . 

Let  P*  be  an  arbitrary  but  preassigned  constant.  The  statistical  problem  is  to  find  a  procedure  R  which 
consists  of  a  sampling  procedure  and  a  terminal  decision  rule  such  that  the  appropriate  probability 
requirement  stated  below  is  satisfied. 

(4.3)  P(CPieo,01,e2,...,0)fe)>P*. 

4.2  Multivariate  Partitioning  According  to  the  Covariance  Structure: 

Now  we  turn  our  direction  to  the  multivariate  partitioning  problems  that  we  will  use  in  radar 
detection  problems.  LetJt  j  ,7C  2 .... , JE  fc  be  k  p-variate  complex  normal  populations  CN p  (|!j- ,  2; ) ,  i 

=  1,  2,  ....  k,  and  letTE q be  the  control,  a  p-variate  complex  normal  population  G/Vp(p,0,2o)  •  Thus 

the  /  function  in  Section  4.1  is  now  taken  to  be  the  complex  multivariate  normal  density  and  the 
parameter  0  is  taken  to  be  Ql,2)  .  We  further  assume  that  (1  =  0  since  our  primary  concern  is  on  the 
covariance  structure  of  the  data.  A  meaningful  distance  measure  between  two  variances  is  the  ratio 
function  d(x,y)=x/y  since  the  variance  is  a  scale  parameter  in  normal  distribution.  The  covariance  matrix 
of  a  multivariate  normal  distribution  plays  a  similar  rule  as  the  variance  of  an  univariate  normal 
distribution  in  many  applications,  especially  in  distribution  theory.  Thus  we  will  also  use  the  ratio  of  two 
covariance  matrices  to  define  the  distance  measure  in  our  study.  We  define  the  distance  function 

(4.4)  d ( Z,- ,  Zq)  =  the  largest  eigenvalue  of  Z/Zq  *  • 

Now  we  define  the  two  disjoint  and  exhaustive  subsets  and  QB  of  the  set 

£2  =  {7E  j  ,71 2  >  •  •  •  >7E  fc  }  by  using  the  distance  function  d.  They  are 

QG={7C/l8t<rf(Zf,Z0)<82} 

(4.5)  and 

£2  A  —  £2  —  £2G  . 

where  8  j  <82  are  pre-assigned  positive  real  numbers  used  to  differentiate  the  good  and  the  bad 

l|t 

populations.  The  values  of  8  j  and  5  2  should  be  near  1  since  for  the  special  case  when  dimension  p=l 
(i.e.,  the  univariate  case),  the  distance  measure  d  has  the  same  meaning  as  the  ratio  of  two  variances.  A 
population  is  good  if  the  ratio  of  its  variance  and  the  variance  of  the  control  population  is  near  1.  This 
also  explain  why  the  function  d  is  chosen  to  define  the  “good”  and  the  “bad”  populations.  Our  goal  is  to 
partition  the  populations  obtained  from  the  reference  data  into  two  disjoint  subsets  SG  and  SB  .  The 
partition  is  correct  (CP)  if  Sq  CZ  £2G  .  It  means  that  all  the  population  included  in  selected  subset  SG 
are  good,  or  they  all  have  similar  covariance  structure  as  the  control  population.  It  also  means  that  all 
the  bad  populations,  i.e.,  the  populations  with  significantly  different  covariance  structure  from  the 
control  population  are  eliminated  from  our  detection  algorithm  We  require  a  procedure  R  that  will 
satisfy  the  pre-determined  probability  requirement  P(CPI  R)  >  P*.  The  proposed  procedure  Rc  is  defined 
as  follows: 
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Procedure  Rc :  For  each  population 7C  j  (  i=  1,  2,  ....  k),  computer  Tt  =  x  Si  X  where  x  is  the  test 

cell  random  vector  and  Sj-  is  the  sample  covariance  matrix  of  the  population  71  ,• .  Partition  the  set  of 
the  populations  Q.  =  {Ki,K 2, k}  ’nt0  two  subsets  SG  and  SB  .  The  subset  SG  consists  of  those 
populations  71  j  with  c  <Tj  <  d  where  c  and  d  are  chosen  so  that  the  probability  requirement  P(CP)  > 
P*  is  satisfied. 

The  selection  statistic  7]  =  x  Si  X  in  procedure  Rc  is  in  the  same  form  as  the  test  statistic  used  in 
Chen  (1994).  This  seems  to  be  a  reasonable  choice  in  estimating  any  scalar  function  of  the  matrix 

S02;  1  when  there  is  only  one  observation  available  in  the  control  population 71  q  .  If  we  consider  S[  * 

as  the  reciprocal  of  the  covariance  matrix,  then  the  measure  used  in  the  selection  procedure  can  be 
thought  of  as  a  ratio  in  matrix  form. 


5.  THE  PERFORMANCE  MEASUREMENT  OF  THE  PARTITIONING  PROCEDURE  Rc  —  THE 
PROBABILITY  OF  A  CORRECT  PARTITION 

In  ranking  and  selection  theory,  we  use  the  probability  of  a  correct  selection  to  measure  the 
performance  of  a  selection  procedure.  In  this  report,  our  goal  is  to  partition  the  populations  according  to 
their  covariance  structures.  A  natural  measurement  of  the  performance  of  the  proposed  procedure  is  the 
probability  of  a  correct  partition  (PCP).  In  the  following,  we  first  derive  the  least  favorable 
configuration  (LFC)  based  on  which  we  will  computer  P(CP).  Then  we’ll  write  the  minimum  of 
P(CPILFC)  in  terms  of  the  multivariate  normal  density  function  and  the  Chi-Square  density  function.  In 
order  to  implement  the  proposed  procedure,  we  need  to  solve  numerically  the  procedure  parameters  c 
and  d  from  the  integral  equation  minP(CP)  =  P*. 

Assume  that  there  are  kj  +  k2  bad  populations.  Among  them,  k 7  populations  7t  j ,7t 2  , . . . ,  7E  £  1 

are  significantly  better  than  the  control  and  k2  populations  ^  k-kj+l’^  k~k2+2^--^  k  310 

significantly  worse  than  the  control.  That  is,  for  7  =  1,2, . . . ,  *j ,  the  largest  eigenvalue  A  of  the  matrix 

<5 1 .  For  1  =  k  —  k,2  + 1  ,  the  largest  eigenvalue  A,-  of  the  matrix  SqS(-  1  >82- 

As  a  consequence,  there  are  k-  kj  -  k2  good  populations.  We  also  assume  that  each  population  has 
sample  size  n. 

Then 


(5.1)  P(CP)  =  P(  all  the  bad  populations  are  eliminated) 

=  P(  Ti  <  c,i  =  1,2,...,  A, ;  Tj  >d,  j  =  k -k2+l,...,k) 

,/I-p  +  l.  T:  .77— 0  +  1.  C  .  , 

=  P{( - £ - )-*-<( - ^ - )— ,1  =  1,2,...,*!; 

O  A;  O  A; 


.77—0  +  1.  .77-0  +  1.  d  .  .  .  ,  . 

( - )tL>( - )— ,j  =  k-k2  +  l,...,k  }. 

P  Ay'  P  Ay 

Tt  ^  j  yi 

We  obtain  from  Chen  (1994),  page  7-8,  the  random  variable  ( - =  1,2,...,*  follows  a 

P  A; 

F  distribution  with  2 p  and  2(n-p+l)  degrees  of  freedom.  Thus,  the  above  expression  can  be  rewritten  as 
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(5.2)  P(CP)  =  P{  fi  <  (”  P  +  l) , i  =  1,2, . . . , kx ; 

P 


-  ,n-p  +  l.  d  .  , 

/;  >( - - - )t  >J  =  k 

P 


■  k2  +  !,•••>& }, 


where  fs  are  correlated  F  random  variables.  It  is  clear  from  (5.2)  that  P(CP)  attains  its  minimum  when 
the  parameters  X^  s,  i  =  1,2,...,  A:  j  reach  their  maximum,  all  at  6  j  and  the 
parameters  X  j  ’s,  j  =  k  —  k2  +  1.  •  •  • .  k  reach  their  minimum,  all  at  8  2  •  Thus  we  can  write 


(5.3)  P(CP) > P{  fi  <  (—  +  = 

P  8! 


-  ,n-p  + 1.  d  .  .  .  .  , 

/,  >< - >— , / = * — *2 + 1,...,* ). 

p  s2 

It  is  clear  from  the  above  expression  that  P(CP)  is  minimum  when  the  number  of  good  populations,  k  - 
kj  -  k2  ,  is  0.  Thus  we  have  shown  the  following  theorem. 

THEOREM  5.1:  The  least  favorable  configuration  for  any  parameter  vector  (Xi,...,X/c)  under 
procedure  Rc  is  given  by 

(5.4)  Xl  =  X2  =•••=  Xm  =  8 j  <82  =  Xm+\  =  ^m+ 2  =•••-  ^k 

where  m  is  an  integer  between  0  and  k  that  minimizes  the  probability  of  a  correct  partition  given  at  the 
right  hand  side  of  the  inequality  in  (5.3). 

From  the  above  theorem,  we  can  write  the  minimum  of  P(CP)  as 

(5.5)  minP(CP)>P{/t-  <  P+  =  1,2, ...,m ; 

P  8! 


.  . n-p  +  l .  d  .  .  . 

fj  >( - )—,j  =  m  +  l,...,k  }, 

P  S2 

where  the  minimum  is  over  all  the  parameter  vectors  (X  j X^  )  . 


In  order  to  implement  the  proposed  procedure  Rc  ,  we  need  to  find  the  procedure  parameters  c  and 

d  for  given  n,  p,  k,  o  \ ,  and  0  2  •  Although  (5.5)  gives  us  a  simple  expression  of  minP(CP),  it  is  not  easy 
to  computer  the  probability  since  it  involves  correlated  F  distributions.  In  the  following,  we  will  use 
conditioning  arguments  to  rewrite  mmP(CP)  in  terms  of  multivariate  normal  density  function  and  Chi- 
Square  density  function. 

We  shall  obtain  the  same  minimum  P(CP)  as  expressed  in  (5.5)  if  the  original  control  population 
has  covariance  matrix  =  I  >  the  identity  matrix  and  the  population  covariance  matrix 
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I;  = 


4r  0  •  •  • 

St 

0  e\  0  .  0 

e\  0  0 


0  • 

-4  0 

to 


for  i  -  the  population  covariance  matrix 


0 


0 


e2 


.  .  ei 

0  .  0 
e2  0  O' 

•  ■  *2  J 


for  i  =  m+  l,m  +  2,...,k ,  where  ej  and  e2  are  any  numbers 


*  _  * 

larger  than  (1  /  0  \ )  and  (1  /  0  2  )  respectively.  Therefore  we  have,  from  (5.1)  and  Theorem  5.1, 
(5.6)  P(CPILFC)  =  ?(xHS;lx  <  c,i  =  1,2  ,...,m;xHS:lx  >d,j  =  m+ 1 


where  jc  is  a  p-variate  complex  normal  random  vector  CN(0,I)  and  S(  is  the  sample  covariance  matrix 
with  covariance  matrix  £,■  and  Sj  is  the  sample  covariance  matrix  with  covariance  matrix  Zj  as  defined 
above.  Define  X  =  (xi,X2,.-,Xp)  where  x(- ’s  are  i.i.d.  univariate  complex  CN(0,1).  From  Rao 

(1973),  the  statistic  Wi  =  LH'ZJlL/  LH L  follows  a  Chi-Square  distribution  with  2(n-p-l) 

H  —1  H  — 1 

degrees  of  freedom  for  any  fixed  vector  L.  Thus  the  statistic  L  Si  L  =  (L  'Ll  L )  /  W}  follows 
the  reciprocal  of  a  Chi-Square  random  variable  times  a  constant.  Thus  from  (5.6),  we  have 

(5.7)  P(CPILFC)=P  {(xHLYlx/xHSflx)>(xHLJlx)/c,i  =  1,2 

(xHL~jlx/  xHSjlx)  <  (xHLJlx)/d,j  =  m+l,m+2,...,k} 

=  j\-\m  >  (8* /c)xlxi  +(U cei)[x2X2+...+xpxp],i  =  1,2,.. .,m; 

Wj  <  (8 2  /d)xlxl+(l/dei)[x2X2+...+xpxp],j  =  m  +  l,...,k}<k(xl,x2,..;Xp)dxldx2  •■■dxp 


where  Wi,i  =  l,...,m  and  Wj ,  j  =  m+  l,...,k  are  i.i.d.  Chi-Square  random  variables  with  2(n-p-l) 
degrees  of  freedom  for  given  x,  Xj,i  =  1,2,...,/?  are  the  respective  conjugates  of  Xj ,  x2 , . . . ,  xp ,  and 
(j)(x^  ,X2  ,...,xp  )  is  the  probability  density  function  of  a  p-varaiate  CN(0,I).  Consider  the  eigenvalues 
gjand  e2  for  the  two  covariance  matrices  Ej  and  L  j  as  parameters  subject  to  change  for  minimizing 
P(CP).  Then  from  the  last  expression  in  (5.7),  the  minimum  of  P(CPILFC)  is  obtained  when  e\  = 
(1  /  8 1 )  and  e2  =  (1/82)-  Substitute  e\  =  (1  /  8 1 )  and  $2  =  (1  /8  2)  in  (5-7),  we  obtain 


(5.8)  mmP(CPILFC)  =  J  J-  •  -J  P{1^-  >  (8  *  /  c)[Xj  Xj  +  x2X2+...+xpxp],i  =  l,2,...,/n; 
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sfe 

wj  I d)[xxx i  +x2X2+...+xpxp],j  =  m  +  l,...,k}$(xi,X2,...,xp)dxidx2  ■  dxp 

The  above  probability  depends  on  Xj , x2 xp  only  through  XjXj  +  x2x2+.--+xpxp  which  has 
a  Chi-Square  distribution  with  2p  degrees  of  freedom.  Thus  we  can  write 

(5.9)  mmP(CPILFC)  =  J(%2(5^  /  d))k~m(l- X2(6^  /  c))mf(y)dy 

0 

2 

where  %  is  the  distribution  function  of  a  Chi-Square  random  variable  with  2(n-p-l)  degrees  of 

freedom  and_/fy)  is  the  probability  density  function  of  a  Chi-Square  random  variable  with  2p  degrees  of 
freedom. 

6.  CONCLUDING  REMARKS  AND  DIRECTIONS  FOR  FUTURE  RESEARCH 

The  limitation  on  the  sample  size  prohibits  us  to  get  a  good  estimate  for  the  covariance  structure 
of  the  control  (i.  e.,  the  test  cell  in  radar  application).  Since  the  statistic  7J  =  xSf^x  intuitively  gives 
us  more  information  about  the  deviation  of  the  covariance  matrix  £,•  from  the  covariance  matrix  Xq 
than  any  other  statistic,  it  was  used  in  Chen  (1994)  to  develop  the  testing  procedure  for  the  homogeneity 
of  the  covariance  matrix  of  the  test  cell  and  the  covariance  matrix  of  the  reference  cells.  The  underlying 
distribution  for  both  the  test  data  and  the  reference  data  in  Chen  (1994)  is  assumed  to  have  0  mean.  As  a 
matter  of  fact,  the  second  order  statistic  (therefore,  the  covariance  matrix)  is  not  even  estimable  if  one 
has  only  one  observation  in  the  test  data  when  the  first  order  statistic  is  not  known. 

This  report  suggests,  as  those  multivariate  selection  procedures  reviewed  in  Section  2,  the  same 

statistic  7J  =  xS{  *x  used  in  testing  be  used  in  partitioning  the  secondary  data.  It  shows  that,  as  in  any 
typical  selection  problem,  the  distribution  theory  studied  for  the  purpose  of  hypothesis  testing  (in  our 
case,  the  test  statistic  T  in  Chen  (1994))  could  be  very  useful  here  in  partitioning  theory.  The  main 
results  we  obtain  in  this  paper  are  (1)  a  new  selection  formulation  for  partitioning  the  secondary  data  in 
radar  signal  processing;  (2)  an  inferential  selection  procedure  to  achieve  the  goal  of  partitioning  the 
secondary  data;  and  (3)  the  formulas  for  the  performance  measurement,  P(CP)  for  the  proposed 
procedure.  For  the  future  study  of  this  problem,  we  shall  divide  the  research  into  the  following  three 
phases. 

Phase  1:  Computing  the  procedure  parameters  c  and  d. 

We  plan  to  write  a  computing  algorithm  to  calculate  the  probability  in  formula  (5.9)  for  general  n,  p, 
L 8 j ,  and  82-  Then  by  using  the  computing  algorithm,  we  shall  be  able  to  tabulate  the  needed 

_  *  >J* 

procedure  parameters  c  and  d  for  selected  combinations  of  n,  p,  k,  8  j ,  and  8  2  •  A  numerical  examples 
should  be  given  to  illustrate  the  proposed  procedure  by  using  the  table  of  c  and  d.  The  work  requires  a 
numerical  integration  of  a  small-valued  function  which  is  the  product  of  Chi-square  distribution 
functions  and  probability  density  function.  It  also  involves  searching  for  the  appropriate  solutions  for 
different  P*  requirements. 

Phase  2:  Simulating  P(CP),  E(S),  E(G),  and  E(B). 

After  the  table  of  c  and  d  is  computed,  simulation  work  should  be  conducted  to  confirm  our  theoretical 
results  on  minP(CP)  and  the  computing  results  on  c  and  d  obtained  in  Phase  1.  At  the  same  time,  we 
need  to  include  in  our  simulation  yet  another  performance  measurement  of  the  proposed  procedure,  the 
expected  size  of  the  subset  that  are  partitioned  as  the  ‘good’  reference  data.  The  expected  subset  size, 
E(S),  is  often  used  in  subset  selection  approach  as  a  criterion  to  evaluate  a  selection  procedure.  The 
smaller  the  expected  subset  size  is,  the  better  the  selection  procedure  is.  Since  our  goal  is  to  screen  out 
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those  cells  with  significantly  different  covariance  structures  and  simultaneously  keep  those  reference 
cells  which  have  the  same  covariance  structure  as  the  test  cell  so  that  a  follow-up  study  on  signal 
detection  can  be  conducted,  the  expected  number  of  bad  cells,  E(B)  and  the  expected  number  of  good 
cells,  E(G)  in  the  subset  partitioned  as  the  ‘good’  subset  could  also  be  used  as  possible  performance 
measurements.  The  above  four  measures,  P(CP),  E(S),  E(G),  and  E(B)  can  be  simulated  in  the 
following  manner. 

(1)  For  given  k,  and  unequal  covariance  matrices  L;  (  i  =  1,  k),  we  generate  n  complex 

multivariate  normal  random  vectors.  We  also  generate  one  complex  multivariate  random  vector  for  the 
control  test  cell  with  covariance  matrix  Eq  . 

(2)  From  the  table  in  Phase  1,  we  obtain  the  appropriate  c  and  d  values  for  given  k  and  P*. 
Calculate  the  statistic  T  ;  for  each  of  the  population  ft;  ( i  =  1,  2, ...,  k). 

(3)  Use  Procedure  R  c  to  partition  the  populations. 

(4)  Repeat  (1)  to  (3)  10000  times  and  estimate  P(CP),  E(S),  E(G),  and  E(B)  respectively  by 
P(CP)  =  #  of  times  the  selected  subset  contains  all  the  ‘good’  populations/10000; 

E(S)  =  sum  of  the  subset  sizes  of  the  subsets  partitioned  as  the  ‘good’  populations/10000; 

E(G)  =  total  #  of  the  true  good  populations  in  the  subsets  partitioned  as  ‘good’/10000; 

E(B)  =  total  #  of  the  true  bad  populations  in  the  subsets  partitioned  as  ‘good’/lOOOO. 

Phase  3:  Evaluating  the  robustness  of  the  partition  procedure  in  radar  applications. 

This  is  the  phase  where  we  will  be  making  assessments  on  the  theoretical  selection  theory  that  we 
propose  in  this  report  in  radar  signal  processing  applications.  From  the  theoretical  derivation  of  P(CP)  in 
Section  5,  we  can  find  the  procedure  parameters  c  and  d  so  that  with  high  probability  P*,  we  are  making 
a  correct  partition  no  matter  what  the  real  situation  is.  If  a  radar  signal  detection  test  designed  for  the 
homogenous  data  is  applied  to  the  good  populations  which  have  been  identified  by  our  partition 
procedure,  it  is  reasonable  to  assume  that  we  will  be  getting  at  least  as  good  results  as  the  original  data. 
But  whether  the  improvement  on  the  performance  measures  in  radar  system  is  significant  for  us  to 
develop  the  partition  procedure  is  an  important  issue.  We  first  need  to  identify  which  standard  procedure 
in  radar  signal  detection  problem  to  be  used  in  evaluating  our  partition  theory.  We  then  need  to  define 
the  criteria  (or  the  performance  measure)  that  we  should  use  to  assess  our  proposed  study. 

In  radar  detection,  Kelly  (1986)  studied  a  generalized  likelihood  ratio  (GLR)  test  to  test  a  given 
signal.  His  test  has  been  used  in  many  articles  (see  for  examples,  Cai  and  Wang  (1990),  Raghavan,  Qiu, 
and  McLaughlin  (1995),  and  Wang  and  Cai  (1990))  as  a  standard  in  comparing  new  methodologies  in 
signal  detection.  We  suggest  that  Kelly’s  procedure  being  used  as  the  signal  detection  procedure  to 
evaluate  our  selection.  The  general  performance  measures  in  radar  detection  theory  are  the  probability 
of  false  alarms  P(FA)  and  the  probability  of  detection  P(D).  We  suggest  that  these  two  measures  being 
used  as  our  comparing  criteria  for  the  data  before  selection  and  after  selection.  The  following  simulation 
study  is  proposed: 

(1)  Simulate  multivariate  complex  normal  data  with  k  covariance  structures  for  the  reference 
data.  Simulate  a  multivariate  complex  normal  data  for  the  test  data. 

(2)  Apply  GLR  test  for  given  P(FA)  on  the  data  obtained  in  (1)  and  record  a  detection  or  not 
detection. 

(3)  Set  the  probability  requirement  P*  and  choose  the  appropriate  procedure  parameters  c  and  d 
from  the  table  obtained  in  Phase  1.  Apply  the  proposed  partition  procedure  R  c  to  the  data  obtained  in 
(1).  We  will  obtain  a  set  of  ‘good’  populations. 

(4)  Apply  the  GLR  test  for  the  same  P(FA)  on  the  ‘good’  populations  and  record  a  detection  or 
not  detection. 

(5)  Repeat  (1)  to  (5)  10000  times  and  calculate  P(DI without  partition  procedure)  and  P(Dlwith 
partition  procedure)  respectively  as  follows. 

P(Dlwithout  partition  procedure)  =  total  number  of  detection  in  (2)/10000; 

P(Dlwith  partition  procedure)  =  total  number  of  detection  in  (4)/10000. 

(6)  Use  the  estimate  obtained  in  (5)  to  assess  our  partition  procedure. 
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ABSTRACT 

In  this  paper  the  analysis  of  variance  (ANOVA)  is  extended  to  the  representation  of  multi¬ 
frame  images.  The  methodology  of  the  optimal  detection  of  moving  objects  embedded  in  noisy 
environments  is  presented,  yielding  ultimately  a  closed  form  solution.  The  new  detector  makes  use 
of  the  information  of  the  spatial  and  temporal  effects,  and  is  uniformly  most  powerful  (UMP)  in 
Gaussian  environment  with  unknown  and  time-varying  noise  variance. 

The  new  detector  is  primarily  an  optical-flow  based  detector  without,  however,  the  drawbacks 
of  the  latter;  it  uses  advantageously  a  feature-based  approach  to  improve  the  efficiency  of 
computation.  The  versatility  of  the  proposed  detector  is  demonstrated  by  the  fact  that,  with  an 
adjustment  of  system  parameters,  it  is  applicable  to  multi-spectral  detection  and  stationary-objects 
detection.  It  is  also  shown  that  all  well-known  detectors,  generally  classified  as  feature-based  or 
optical-flow-based,  become  subclasses  of  the  new  detector  if  the  proposed  detector  is  subjected  to 
more  restrictive  conditions. 

Performance  of  the  proposed  detector  is  fully  supported  by  extensive  real  image  simulations. 
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AUTOMATIC  MOVING  TARGETS  DETECTION  AND  ESTIMATION 


USING  CONTAMINATED  DATA 
Julian  Cheung 


1.  INTRODUCTION 

The  detection  of  moving  objects  using  a  sequence  of  time  frames  has  been  of  major  concern 
throughout  the  evolution  of  computer  vision,  because  of  its  broad  spectrum  of  applications,  i.e., 
biomedicine,  tactical  and  strategic  military  applications,  industrial  automation,  meteorology,  and  data 
compression. 

Because  of  these  efforts,  a  multitude  of  moving-object  detection  algorithms  have  been 
developed.  Most  of  them  can  be  categorized  into  two  classes.  The  first  class,  generally  based  on 
feature-based  techniques,  involves  the  extraction  of  a  set  of  relatively  sparse  (less  than  10%),  but 
highly  discriminatory  tokens  such  as  edges,  comers,  and  sharp  changes  in  curvature  characterizing 
objects.  Moving  objects  detection  problem  is  thus  reduced  to  the  correspondence  of  the  tokens 
between  the  consecutive  frames  [l]-[4].  The  second  class  is  based  on  computing  the  optical  flow, 
i.e.,  the  2-dimensional  (2-D)  field  of  instantaneous  velocities  of  gray  levels  at  every  pixel  in  the  image. 
By  the  rigidity  constraint  that  all  pixels  of  an  object  move  with  the  same  velocity,  the  optical  flow 
segments  the  image  scene  into  distinct  regions  associated  with  the  moving  objects.  Their 
identification  is  achieved  using  some  similarity  measures  [5]-[8], 

The  principal  problem  with  the  first  approach  is  that  it  depends  on  the  token  attribute  which 
is  highly  affected  by  the  modeling  of  multi-frame  images.  Also,  the  observed  image  tokens  may  not 
correspond  to  the  object  tokens,  probably  because  of  segmentation  errors  or  occlusion  caused  by 
components  of  the  background  scene.  Furthermore,  establishing  correspondence  between  sets  of 
tokens  between  frames  is  not  a  trivial  matter. 

The  main  drawback  of  the  second  method  is  that  since  derivative  operation  accentuates  noise, 
optical  flow  technique  is  highly  susceptible  to  noise  influence.  As  is  well-known,  in  many 
circumstances  image  flow  does  not  correspond  to  optical  flow,  the  common  practice  of  computing 
image  flow  (from  data)  and  treat  it  as  the  optical  flow  then  becomes  questionable.  Although  to  a 
lesser  extent,  occulatory  effects  also  may  play  havoc  with  object  detection. 

As  is  valid  in  almost  all  computer  vision  problems,  an  over-simplistic  or,  worse  of  all, 
improper  modeling,  of  an  image  when  applied  to  any  of  the  approaches  described  above  may  result 
in  that  “we  often  assume  that  the  scene  is  composed  of  surfaces  each  of  which  has  a  simple  shape  - 
in  a  simple  way.  -  As  a  result,  standard  vision  techniques  often  do  not  perform  very  well  when 
applied  to  natural  scenes”  [9],  Also,  most  methods  are  primarily  developed  for  cases  that  involve 
either  low  noise  or  noise-free  images.  These  idealized  conditions  are  increasingly  invalid  in  the 
present  era  of  wide  usage  of  the  information  super-highway.  This  is  a  handicap  in  itself  due  to  a 
rather  high  number  of  false  alarms  that  results  in  applying  such  techniques  in  any  actual  detection 
problems. 

This  motivates  the  present  research,  which  is  directed  toward  the  development  of  a  moving 
object  detector  that  is  uniformly  most  powerful  (UMP)  in  Gaussian  noise  environments  of  unknown 
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and  varying  variance  noise.  The  paper  is  organized  as  follows. 

In  section  II,  we  model  a  scene  pixel  as  contributed  by  the  effects  of  rows,  columns, 
diagonals,  and  frames,  thus  permitting  all  local  pixels  to  have  a  reasonably  large  variation  in  gray 
levels.  These  effects  are  then  estimated  from  data.  In  section  III,  the  utilization  of  local  contrasts 
(per  frame),  as  applied  to  multi-frame  images  in  edge  detection  is  derived.  This  ultimately  leads  to 
a  moving-edge  detector  which,  as  shown  in  Section  IV,  is  based  on  the  difference  of  local  contrasts 
between  two  frames.  In  section  V,  the  optimal  detection  of  moving  objects,  based  on  the  generalized 
likelihood-ratio  test  (GLRT),  is  derived.  The  duality  of  the  GLRT  and  the  multi-comparison  test  is 
subsequently  established,  and  the  generality  of  the  proposed  detector  is  explained.  An  implementation 
algorithm  and  computer  simulation  results  based  on  real  noisy  images  are  considered  in  section  VI. 
Further  discussions  and  suggestions  are  presented  in  section  VII. 


II.  THREE-D  GRAECO-LATIN  SQUARE  DESIGN  FOR  DYNAMIC  SCENES 
A  Graeco-Latin  square  (GLS)  model  is  a  four-way  design  which  contains  four  treatments  groups  with  I 
levels  each.  Consider  the  layout  of  the  treatments  in  the  first  frame  of  a  sequence  of  dynamic  scenes, 
i.e.,  m  - 1  of  a  /x/,  7=5,  GLS  design  illustrated  in  Fig.  1 .  In  this  design,  there  are  four  treatment 
groups:  the  rows,  the  columns,  the  Tau's  and  the  Lambda's.  The  Tau's  (Lambda's)  are  arranged  in 
a  cyclic  row  (column)  pattern  representing  45°  (135°)  diagonal  effects.  The  treatments  are 
orthogonal  in  the  sense  that  all  treatments  appear  together  only  once  in  the  entire  design  [10],  Let 
{ a1;»  }/=r  *  i  {Ah}{=i  be  the  row,  column,  45°  diagonal,  135°  diagonal  effects, 

respectively,  in  frame  1 .  Each  pixel  value  is  represented  by 

yvjj  =  p  +  +  Yj  ,j  +  +  Xvl  +  e1Uj.,  ij,k,l  e  D] ,  and  Dx  is  the  set  of  1 2  observations  in  frame 

1 .  Here  p  denotes  the  grand  mean  of  all  frames,  the  mean  effect  of  frame  1 ;  eVjJ.  is  the 
uncertainty  (noise)  associated  with  the  ith  row,  yth  column  observation  that  is  Gaussian  distributed 
with  mean  0,  variance  a2,  and  is  pair-wise  statistically  independent  on  the  same  frame. 

A  three-dimensional  (3-D)  GLS  design  is  an  outgrowth  of  the  regular  GLS  design  with  the 
additional  dimension  pertains  to  the  frame  number  among  a  sequence  of  registered  scenes.  The 
parametric  model  of  a  multi-frame  scene  becomes 

Q:  ym-jj  =  \i+L  +  %  +  Ym-J  +  xmjc  +  Km  +  W=1>”>M  (21) 

Here  Dm  is  the  set  of  I2  observations  corresponding  to  /nth  frame.  Define  D  =  {DX,  -,Dm },  then  D 

is  a  collection  of  M  frames  of  observations.  Of  course,  the  orthogonality  of  the  treatments  extends 
readily  to  the  collection  of  frames. 

Since  the  parameters  p ,  { Cm ) ,  { am.(. },  { y  },  { xm;k },  { Xml }  are  generally  unknown,  they 
are  estimated  from  the  data.  This  is  attained  by  forming  the  sum  of  squares 

5  =  E  {jw  -  m  -  Cm  -  <v,  -  ym,  -  -  K-I  f  ■  (2-2) 

e  D 

Taking  the  derivative  of  S  with  respect  to  p  and  equating  it  to  zero  to  obtain 
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Mil 


mi2  v-  =  E  E  E>w  -  /2E  Cm  -  7E  E  -^EEy^^EE^^EE  *m;/>  ™th 

m  =  l  i=l  y=l  m  mi  m  j  m  k  ml 

the  resulting  least-squares  estimate  of  p  as 

p  =  p  =>>.  -  C  ~  -  Y  ;  - 1 .  -  A  .  .  (2.3) 

The  convenient  dot  notation  indicates  that  whenever  a  subscripted  variable  is  replaced  by  a 
dot  it  means  that  unweighted  averaging  operation  has  been  taken  place  over  the  affected  variable,  i.e., 
v  .  =  /_1E.  v  v  =/_1  E.  v  . ,  v  =M'1E„  v  .  ,  C  =  A/"1  £  £  ,  a  =/‘1X).  a  ., 

a.  =M_1  Em  a  ,  etc. 

Analogously,  taking  the  derivative  of  5  with  respect  to  Cm,  aw;/,  Ymy>  setting  each  of  them 
to  zero  and  recognizing  that  for  fixed  m  and  / ,  ^0MeDm  xm-jc  =  K  Tm*>  ^ujc,i)eDm  Kj  =  K  Am;/ while 
for  fixed  m  and  j,  Z(iXl)eDm  =  £*  t^,  E^^  Am;i  =  E,  Aw;/  (see  Fig.  1),  yields 

Cm  =  C„  =  -  F  -  «m,  -  Ym,  -  V  -  Am;  (2-4) 

a  .  =  a  .  =  y  .  -  p  -  £  -  Y  ~  t  “A  (2.5) 

y  =  y  =v  .  —  w  —  C  -  a  -t  -A  (2.6) 

•  my  ‘my  -Km;.;  r*  ’m  m;.  m;.  m;. 

respectively. 

By  the  same  token,  upon  taking  the  derivative  of  S  with  respect  to  x^,  Am;Z,  setting  each 
of  them  to  zero,  and  simplifying,  yields 

Xmjc  =  Xmjc  =  /_1  EJ=1  y m\ij  "  F  -  Cm  “  «M,  "  Y„,  "  V  (2J) 

Am;/  =  K-l  =r'Y!i=l  ymj]  ~  F  ~  Cm  -  «rn;.  "  Y  m,  "  V  >  (2’8) 

where  j  equals  k+\-i  if  A+l>/,  it  equals  k+l-i+I  otherwise;  also  j  equals  i+l-l  if  i+l-\<.I,j 
equals  /+/-1-7  otherwise. 

According  to  (2. 1),  ctm.i  being  the  row  treatment  effect  is  independent  of  other  directional 

effects.  This  is  also  valid  for  its  least-squares  estimate  am.i  and  has  the  implication  that  the  last  three 

terms  in  (2.5)  vanish.  The  same  reasoning  leads  to  the  elimination  of  the  last  three  terms  in  (2.6-8). 
Furthermore,  p  is  the  grand  mean  of  all  the  available  frames  and  it  is  independent  of  { £  },  { a  .  }, 

{ y .  },  (t.  },  (A.  }.  Therefore,  the  term  £  disappears.  In  brief,  in  the  3-D  GLS  design  a  set  of 

natural  side  conditions  are 

M 

c„  =  o 

m-l 

»=i  /=i 

V  =rl  E  xm*  =  °>  K,  E  K-i  =  °>  m=1>  ~>M- 

k= i  ;=i 

Therefore,  the  least-squares  estimates  (2.3)-(2.8)  can  be  simplified  as 
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(2.10-15) 


A  =y,„.  >  L  =ym,.  -y,. 

® m,i  ~  y m;i.  T m;..  >  Y  m-j  ~  y m;.;  -P m;.. 

5Z/=1  -P m;iy  m;..  ’  ~  ^  =  1  -l" m;y  m;.. 

respectively,  and  j  ,  j  have  been  defined  before. 

For  subsequent  analysis,  introduce  a  p  x  1  parameter  vector  P  =  [  g  C  n  Y  t  If, 
/>=1+M+4M7,  with  the  sub-parameter  vectors  CMxl  =  [  Cx  -  C^]7, 

«^)X1  =  [  «l;l  -  « W «M1  -  ]'  >  Y(M/)X1  =  [  Y1;1  -  YUJ  y  >  ^/)X1  =  [  Tl;l  -  V  ^  > 

A(M°xl  =  [  Aj.j  ••  A .M.j  ]'.  This  permits  a  compact  representation  of  the  parametric  model  (2. 1)  as 


Q: 


yn*l  _  p/?xl  +  e- 


nxl 


(2.16) 


where  yml  =  [yvn  - yl;U yVJJ  -yVJJ yM.n  -  y  WJ  ]'  is  the  observation  vector  formed  by  the 


row  stacking  of  the  sequential  frames,  n  =MI2,  J  -I,  and  e”xl  is  the  corresponding  noise  vector 
with  the  distribution  N(0,  a2/).  In  the  n*p  four- way  effect  matrix  X',  each  element  equals  1  or  0 
depending  on  whether  the  associated  level  of  a  particular  treatment  group  is  present  or  not. 

For  any  3-D  GLS  design,  X!  is  known  a  priori  while  P  may  be  estimated  by  the  least  squares 
approach.  However,  in  general  X'  is  not  of  full  rank.  In  fact,  the  n"p  effect  matrix  X'  has  the  rank 
r,  r=p  -  t,  p=l  +M+AMI,  t=  1  +  AM.  Consequently,  the  least-square  estimate  P  is  not  unique. 
This  poses  a  lot  of  theoretical  and  implementation  problems  because,  as  will  be  clear  later,  statistic 
for  moving  object  detection  is  associated  with  the  linear  transformation  of  the  parametric  vector  P 
which  is  supposedly  unique.  To  circumvent  this  dilemma,  we  consider 


Theorem  1  (Unique  parameter-vector  estimate)  Suppose  in  the  parametric  model 
yn xl  =  X' Ppxl  +  e  the  n*p  effect  matrix  X!  has  rank  r,  r<p<n.  By  imposing  the  regularity 
conditions  i)  a  set  of  t=p  -  r  linear  restrictions  on  P,  i  e.,  H'  P  =  0'xl,  where  H'  is  t*p  with 
rank  -  t,  and  ii)  no  rows  of  H'  is  a  linear  combination  of  rows  of  X!  except  0,  then  a  unique 
least-squares  estimate  of  P  is 

P  =  By,  B  =  (  XX'  +  HH)1  X  (2. 17) 

Proof:  Refer  to  [12], 


Corollary  1.1  Transferring  the  inverting  matrix  in  (2.17)  to  the  l.h.s.  and  utilizing  the  identity 
jfiT7  p  =  0  we  get  ( XX 1  +  HH')  P  =XX'  P  =Xy.  Replacing  y  by  (2.16)  and  upon  taking  the 

expectation,  then  XX' E  (P)  =  XE  (y)  =  XX'  P,  so  that  P  is  an  unbiased  estimate  of  P .  Since  for 
normal  distribution  least  squares  estimate,  mean  square  (minimum  variance)  estimate  and  maximum 
likelihood  estimate  coincide  [11],  P  is  the  best  linear  unbiased  estimate  (BLUE)  of  the  unknown  P . 


Corollary  1.2  The  least-squares  sum  can  be  simplified  to  Se  =  (y  -X'fy)'  (y  -X'fy)=y(y  -X'§). 
Applying  Theorem  1,  we  obtain  the  least  squares  sum 
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s,-y’Q'Qy.  Q-i-x'b 


(2.18) 


In  reality,  the  pixel  variance  a2  is  unknown.  However,  the  theorem  below  indicate  the 
relationship  between  Sg  and  the  estimate  62  of  o2 . 

Theorem  2  (unbiased  estimate  of  o2)  The  ratio  s2  -  Sei  (n-r)  is  the  BLUE  of  the  unknown 

variance  a2. 

Proof:  Refer  to  [12], 


m.  MULTI-FRAME  SEGMENTATION 

Ideally,  an  edge  can  be  defined  as  an  abrupt  transition  from  one  homogeneous  region,  in  terms  of  gray 
levels,  to  another  homogeneous  region.  This  step-like  distribution  of  an  edge  enables  its  detection 
using  derivative-based  local  operators,  including  Sobel,  Kirsch,  etc.  [13],  However,  the  available  data 
set  is  usually  corrupted  by  noise  which  results  in  blurred  edges  and  false  features.  Also,  an  edge  is 
sometimes  associated  with  a  line  transition  which  further  complicates  the  edge  detection  process.  As 
such,  it  is  advantageous  to  use  a  statistical  technique  that  would  provide  both  robustness  as  well  as 
discrimination  of  lines  in  the  course  of  detecting  possible  edges. 

For  ease  of  development,  we  will  focus  on  the  detection  of  vertical  edges.  Consider  a  5  x  5 
rectangular  mask  that  scans  the  first  five  rows  of  frame  1 ,  horizontally  and  starting  with  the  first  5 
columns,  then  the  next  5  columns,  and  so  forth.  When  it  hits  the  last  column  it  makes  a  zigzag  return 
and  repeats  the  same  procedure  on  the  next  5  rows,  etc.  Each  pixel's  intensity  is  due  to  a 
combination  of  various  effects  which  we  call  treatments.  In  a  5  x  5  region  each  treatment  has  only 
a  very  limited  number  of  values  which  we  call  levels.  This  suggest  the  GLS  design  (2. 1)  is  a  suitable 
model  for  the  representation  of  gray  levels  on  a  local  basis.  The  edges  of  frame  1  (taken  at  time  ) 


that  is  within  the  mask  shown  in  Fig.  1  can  be  expressed  using  the  following  contrast  functions 


Y,.i  - 

7^ 

,2  +  Y  ,;3  +  Y 1 

;4  + 

Y  1;5  ) 

i(Y, 

;1  +Y!;; 

!  )  ~  7  (  Yl;3 

+  Yi 

L;4  +  Y 

7<Y. 

;1  +Y  1;: 

l  +  Y  1;3  )  ~  7 

(Y, 

U4+Y 

;i  +Y1;: 

!+  YlS+Yl;. 

4)- 

Y  1;5 

(3.1) 


Under  no  noise  situations,  if  there  is  no  edge  (indicating  homogeneity),  each  of  the  contrast 
functions  equals  0 .  On  the  other  hand,  if  an  edge,  for  example,  exists  between  columns  2  and  3 , 
then  £j,2  attains  the  highest  amplitude  among  all  contrast  functions.  Therefore,  the  contrast  functions 

approach  results  in  enhancing  real  edges  and  suppressing  false  edges.  In  the  presence  of  noise,  the 
edge  detection  problem  can  be  expressed  as  a  pair  of  hypotheses  testing  Hq]:  £1;1  =  £1;2  =  -  =  £1;4 

against  H[e):  !(  £1;1  =  £1;2  =  -  =  S1;4).  Rewrite  £1;I.  as  £1;/  =  c1;.  QTl,  cw  =  [  c1;j7  -  cVJp  ],  to  explicitly 
indicate  that  every  contrast  function  is  a  linear  combination  of  the  parametric  vector  P  with  the 
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corresponding  contrast  coefficient  vector  (3.1).  For  example,  we  have  cV2 1+A/+/+1  =  cV2  i+m+i+2  = 
ci-2.i+A/^/+3  =  ■"  =ci-2itA/t2/-i  =  "1/3,  and  c^.  =  0  otherwise.  Define  the  contrast  vector 
=  [  5j.j  -  Sj.  ]7  and  the  contrast  coefficient  matrix  C1  =  [  c(;1  c[.2~  c[.q  ];,  q  =  4,  the  set  of 
contrast  functions  in  frame  1  is 

Sj  =  cfxp  p**1.  (3-2) 

The  edge  detection  problem  can  now  be  reformulated  as 

H[e)\  ^  *  0qxl  . 

As  discussed  before,  the  parameter  vector  p  is  unknown,  and  so  is  replaced  by  its  estimate 
.  Since  £x  =  Cx  P  is  a  linear  combination  of  P,  by  the  Gauss-Markov  theorem  [14,  pp.  14-15]  the 
BLUE  of  £x  is  £x  =  Cx  $  among  all  linear  estimates,  on  the  condition  that  0  is  the  BLUE  of  P  which, 

we  recall,  has  been  established  in  Theorem  1.  Consequently,  an  efficient  estimate  of  the  contrast 
vector  is 

£i  =  ci  By=Aiy,  (3.4) 

where  Ax  =  CXB,  and  B  is  defined  in  Theorem  1. 

Since  y  is  normally  distributed,  its  linear  transformation  ^  =Axy  is  also  normally  distributed, 

yielding  |x  ~  N(  o2E{  ) .  Applying  SchefFe's  multiple  comparison  theorem  [15],  for  any  £x  in  the 

q-D  contrast  space  spanned  by  the  basis  vectors  { •••,  )  its  BLUE  £x  falls  in  the  confidence 

ellipsoid 

*  V1  Fm„.r  (3.5) 

with  the  probability  1  -  a .  Here  q  =  rank  (£x)  =  4 ,  n  =  MI 2 ,  r  =  rank  ( X! ) . 

Returning  to  (3.3),  the  no  edge  phenomenon  is  identical  to  the  situation  that  every  contrast 
function  {$w}f=1  is  null.  Therefore,  every  contrast  function  in  the  q-D  contrast  space  vanishes. 

Under  H0  the  confidence  ellipsoid  becomes  Ex  £x  ^  qs2  Fa.qn_r.  Testing  statistic  for  (3.3),  at  the 
a -level  of  significance,  is 

K  I.  %  9S2F  r  (3-6) 

The  rejection  of  #0(e)  indicates  there  exists  an  edge  (at  least  one)  in  the  underlying  mask.  To 

locate  the  edge  we  may  choose  the  \Vi  corresponding  to  the  highest  absolute  value  among  all  q 
contrast  functions,  since  it  is  the  most  likely  candidate  responsible  for  the  rejection  of  H0 . 

The  edge  detector  above  can  only  detect  and  locate  vertical  edges  and  is  of  very  limited 
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practical  use.  We  thus  introduce  a  modified  edge  detector  ^  =  |^r);  ^  ),  sensitive  to  four 

major  directions  (in  frame  1).  By  using  the  information  in  £jr),  %f,  £jc),  ^ ,  it  detects  edge  features 
oriented  in  0°,  45°,  90°,  135°,  respectively.  Let  j  zj£j  |  /'=l,-,7;  be  elements  of  the  7t- 

directional  mask  corresponding  to  tt  =  r,d,c,s  and  is  associated  with  the  physical  data 
iylw  I  /= W;  j=  h-M  by 

(r)  _  (d)  _ 

zi  ;y">V  (3.7-10) 

(c)  (s) 

ziw=y^  zvj=ywv-i  ■ 

In  short,  the  theory  of  edge  detection,  discussed  hitherto,  can  be  applied  to  7t  -directional  detection, 
with  iyVij )  replaced  by  { zfij } .  The  foregoing  procedure  is  applied  analogously  to  the  detection  of 
edges  in  frame  m  (taken  at  tm)  with,  however,  the  subscript  1  replaced  by  -,M . 


TV.  DETECTION  OF  MOVING  EDGES 

Consider  a  rigid  body  moving  in  the  easterly  direction  at  a  rate  slower  than  the  shuttering  speed  of 
a  camera  mounted  on  a  stationary  platform.  Without  loss  of  generality,  assume  the  body  is  larger 
than  the  processing  mask  (typically  5x5).  If  it  is  not,  one  may  decrease  the  mask  size  and  increase 
the  FAR  for  a  satisfactory  power  of  detection.  Suppose  at  time  tx  a  region  (  91  hereafter)  that  covers 
the  object  including  its  leftmost  boundary  and  occupies  the  mask  from  column  3  onward.  From  the 
point  of  view  of  the  mask,  pixels  between  columns  3  and  5  belong  to  the  object,  while  the  area  from 
columns  1  to  2  belong  to  the  background.  The  change  of  gray  intensity  between  columns  2  and  3 
constitutes  an  edge  and  is  a  subset  of  the  outermost  contour  of  the  object. 

Suppose  at  t2  the  object  moves  eastward  by  one  pixel.  The  rigidity  of  the  object  mandates 

that  all  pixels  of  the  object  move  by  the  same  amount  and  in  the  same  direction,  implying  that  91  now 
fills  columns  4  to  5 .  In  brief,  object  movement  translates  the  vertical  edge  from  column  3  to  column 
4  in  the  time  interval  (tv  t2\ .  On  the  other  hand,  if  the  object  is  stationary,  its  vertical  edge  stays  the 

same  from  one  mask  to  the  next.  This  suggests  the  detection  of  edge  movement  is  crucial  to  the 
discrimination  of  dynamic  objects  from  stationary  counterparts.  The  mechanism  is  consistent  with 
the  operation  of  the  human  visual  system  (HVS),  which  differentiates  a  moving  object  from  a 
stationary  object  by  comparing  whether  the  contour  (possibly  closed)  of  the  underlying  object  moves 
or  not  relative  to  a  human  observer  from  one  frame  to  the  next  (tx  to  t2).  The  detection  of  vertical- 

edge  movement  may  thus  be  described  verbally  as  H^:  no  vertical-edge  movement  within  masks 

of  two  successive  frames  on  the  same  neighborhood  versus  Hxm^:  at  least  one  vertical  edge  moves 


in  the  masks. 

Introduce  a  set  of  composite  contrast  functions 

t  =  £i rke  *'=i> 


(4.1) 


6-9 


With  E,mi  defined  in  (3. 1),  m  =1,2,  then  t|r(.  =  dt  p  and  so  every  composite  contrast  function  is  a  linear 
combination  of  the  parameter  vector  p  with  the  corresponding  composite  contrast  coefficient  vector 

di  =  evj  ~  C2,i  =  t  du  •  diP  L  dij  =  ci;iJ  ~  c2jj>  J  =  l>  ->P-  Applying  (3.1)  to  get  £f=1  d  =0  and,  in 
conjunction  with  the  linear  independence  of  i|r  ,  i|r  ix  *  i2,  every  composite  contrast  function  is  a 

linear  contrast  function  [14].  Thust|fp  t]r^  are  the  basis  vectors  spanning  the  q-D  composite 
contrast  space  a>cc  and,  in  particular,  the  composite  contrast  vector  i|r ,  i|r  =  [  ]' ,  belongs  to 

cocc.  Rewrite  i|r  as 


t|r  =  Z)p,  (4.2) 

where  Dq*p  =  [  d[  -  d!q  ]'  is  the  corresponding  composite  contrast  coefficient  matrix.  Applying 

Theorem  1  to  obtain  the  unbiased  estimate  tjr  of  t]r,  we  have 

ty  =  DBy.  (4.3) 

Making  use  of  the  Gauss-Markov  theorem  [14]  together  with  the  recognition  that  tjr  is  a  linear 
combination  of  tjr,  tjris  the  BLUE  of  t]r,  i.e.  i]r  ~  N(  tjr,  o2K),  K=  (DB)  {DB)'. 

Returning  to  (4. 1),  ijr(.  is  a  difference  of  two  contrasts  each  of  them  is  the  column  contrast  of 
two  adjacent  regions  in  a  frame.  Clearly,  ijr  equals  zero  whenever:  i)  there  is  no  edge  at  column  i 

in  the  same  neighborhood  in  the  frames,  or  ii)  if  an  edge  exists  at  column  i  in  the  frames  (static 
object).  Therefore,  when  i|r  covers  the  origin,  it  signifies  no  shifts  in  vertical  edges  (locally).  The 
hypothesis  testing  for  vertical  edge  shifting  can  thus  be  mathematically  expressed  as 

H™:  tjr  h  Dqxp  p^xl  =  0 9x1  ^  ^ 

H[m):  i|r*09Xl  . 

Because  of  the  q- D  composite  contrast  space  as  established  in  the  previous  paragraph,  the 
implementation  of  (4.3)  is  identical  to  (3.6);  it  is  rewritten  here  as 

HW 

ty'K1  tjr  ><  qs2Fa.qn_r.  (4.5) 

The  detection  of  four-directional  edge  movement  is  developed  analogously  to  edge  detection. 


V.  DETECTION  OF  MOVING  OBJECTS 
a)  Generalized  Likelihood  Test 

Consider  in  frame  1  a  dynamic  but  rigid  object  that  encloses  a  maximal  rectangular  mask  of 
dimension  /  *  J.  Suppose  the  mask  is  aligned  with  the  frame  and  the  top  left  corner  is  at  location 
(xp  j'j)  of  the  latter.  Since  the  mask  may  no  longer  be  a  square,  the  diagonal  effects  exhibited  in  a 
GLS  design  have  no  physical  meaning,  (2. 1)  is  modified  to  represent  mask  pixels,  for  m  =  1 ,  as 

Q:  ym = f  +  +  +  ymJ  +  /  =  1>  ->  rJ  =  •> J’  (51) 

where  the  same  terminology  as  before  is  used. 
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Consider  a  second  mask  of  the  same  size  aligned  with  frame  2  in  a  manner  its  top  left  comer 
is  at  (x2,  y2)  in  frame  2 .  Model  (5.1)  is  also  valid  for  the  second  mask  (m  =  2).  Pixel  gray  levels  in 
the  aforementioned  masks  (aligned  at  different  positions  on  two  successive  scenes)  are  described  in 
terms  of  row,  column,  frame  effects,  and  can  be  represented  compactly  by  Ppxl ,  which  is  defined  in 
(2.16).  From  the  parametric  model  (5.1)  Ppxl  cQ,  where  p  =  1  (grand  mean)  + 
2  (frame  effects)  +21  (row  effects)  +  2J  (column  effects).  The  corresponding  effect  matrix  X' 
is  similar  to  (2.1)  with,  however,  no  diagonal  treatment  effects.  Due  to  the  displacement  of  the 
masks,  the  matching  of  them,  i.e.,  effects  in  one  mask  equal  to  that  in  the  other,  implies  the  existence 
of  a  moving  object.  This  results  in  a  sub-parametric  space  fa), 

to  =  Q  n  (C1=C2,  a1;1=a2;1,  ",  avj=a2j>  Y1;1=Y2;1,  '"’YvjrYi-J)  >  0)cQ’  ™th  the  corresponding 
parameter  vector  Pu .  Henceforth,  all  associated  parameters  tantamount  to  the  same  restriction  are 
emphasized  so  through  the  subscript  fa).  Under  the  parametric  space  fa),  the  frame  effects  ,  £2  are 
the  same  and  are  absorbed  into  the  grand  mean  p.  Also,  since  a1;/  =  a2;/,  they  are  denoted  by  a 
common  variable  a  ,  i  =  1,  -,  /;  similar  reasoning  leads  to  the  new  variable  (3 \.,  j  =  1,  •••,  J,  yielding 

a  pw  x  1  parameter  vector  Pu,  P^  =  1  +  /  +  J.  The  corresponding  effect  matrix  X/u  is  changed 

accordingly.  It  then  follows  that  the  detection  of  a  dynamic  object  corresponds  to  being  able  to 
discriminate  whether  p  or  Pu  better  describe  the  data.  The  detection  of  a  moving  object  can  thus 

be  formulated  as 

if„:  j'-X'P  +  e  (52) 

Hv  ^  =  + 

where  e  ~  N(  0,  a2/).  As  explained  before,  X!  is  of  n*p  with  rank  X1  =  r,  r  =p  - 1,  t  =  1+2+2; 
while  X,u  is  of  and  rankJ^  =  ru,  ra  =pu  -  ta,  pu  =  1  +1 +J,  tu  =  2. 

Defining  py|^(y|#;.)  as  the  conditional  probability  density  function  (pdf)  of  a  stochastic 

image  Y2!Jxl  (row  scanning  of  a  3  -D  mask)  with  the  realization  y2!Jxl  under  hypothesis  Ht,  /=0,1 , 


the  generalized  likelihood  statistic  for  (5.2)  is 


Ky)  = 


l#i> 

maxQ  P  Y\H0(y\Ho) 


Hy 

H0 


(5.3) 


and  the  testing  statistic  rejects  H0  if  k(y)  >  X0,  where  the  constant  A0  is  chosen  to  give  the  desired 

significance  level  a  [16], 

Under  H0 ,  the  multivariate  conditional  pdf  is 

pY\HJiy\H0)  =  (2  no2  \'An  exp  {-  ^  S(y,  P )  /  o2  },  (5.4) 

and  SQ(y,  P)  the  sum  of  squares  error 

5(J’,P)  =  0-VP)'(J-VP)  =  |^-X'P|2.  (5.5) 
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Since,  practically  speaking,  P,  a2  are  unknown,  PY\H(.y\H0)  cannot  be  evaluated.  To 

circumvent  this  difficulty,  we  replace  the  unknown  parameters  by  their  MLE's  [16].  Apply  (2.18)  to 
obtain 

S(y,  P)  =y'Q'Qy,  (5.6) 

where  Q  =  I  -  X!  B ,  B  -  (XX1  +  HH1)1  X .  With  the  aid  of  Theorem  2,  the  maximal  pdf  is 

/  .  \  -nil 

max  pY,H  (y  |/70)  =  (2  tz  a2  \Vl”  exp  {  -  V2  S(y,  P)  /  a2  }  =  2n  —  -  exp  {  -  V2  (n  -  r) }  . 

(3  0  n~r  )  (57) 

Similarly,  under  Hl  the  least  squares  error  and  the  corresponding  maximal  pdf  are 

(5.8) 


max  prm(y\H:i)  =1  2it 


%JS) 


exp  \-V2(n-r) 


^P)l 
%J)I ' 


respectively,  where  Q<:  =  I-X,„ =(kK  +  xa  ■  Substituting  (5.7), (5.9)  into 


(5.3)  and  taking  the  natural  logarithm,  the  generalized  likelihood  statistic  is 


V4  in  -  r ) 


S(y,P)-sa(r,P) 

*Cf,P) 


><  K  • 


(5.10) 


Since  rank X^}  =  I  +  J-  l<2I  +  2J-2  =  rank  AT7,  following  the  vector  space  concept  of 
Theorem  2,  it  is  obvious  that  S(y,  P)  utilizing  higher  order  of  basis  vectors  incurs  a  smaller  error  than 
Su(y,  P),  implying  that  A0  is  a  negative  value.  Dividing  both  sides  by  -  V2  q,  we  have 


\(y)  = - - -  ><  A0  , 

%,p)/(«-r)  Hx 


(5.11) 


where  q  =  r  -ru  =  /  +  J  -  1.  Apparently,  A(y)  is  a  sufficient  statistic  that  decides  on  H1 ,  i.e.,  the 
existence  of  moving  object,  whenever  A(y)  <  A0;  otherwise  it  favors  H0,  i.e.,  no  moving  objects  in 
the  3  -D  mask.  In  the  above  the  constant  A0  is  not  necessarily  the  same  from  one  line  to  the  next. 
For  a  given  significance  level  a,  A0  and  a  are  related  by 


3M - (under  H0)  <  A0  =  1  -  a  . 

S(y,$)/(n~r) 

Applying  the  vector  space  concept  of  [12]  once  more,  it  is  not  difficult  to  see  that 

P)  =  (E”=1  1  c&)  (E"-i  aA  -  E^i  cfij) 

=  Ef=l  a?  +  Ti-qyl  (ai  -  cf  +  E"  r+l  af- 

With  the  choice  aj  =c  ,  i  =  q+ 1,  r,  under  HQ,  then 


(5.12) 


(5.13) 
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2  2  2 

Xg  +  Xn-r  —  Xg  +  n  -  r  » 

(5.14) 

(5.15) 

which  simplifies  (5.12)  to 

J  1  &,-,-£,>/<?  <  *  L  o>{^  ,  <  )  =  1  -  a  ,  (5-15) 

X„.rl(n-r ) 

%2  /  q 

where  F  n_r  = - - - is  Fisher's  F-statistic  with  q,  n-r  degrees  of  freedom  (df).  The  threshold 

q’nr  J  ^-r) 

A0  =  Fa.qn_r  can  be  found  using  the  tabulated  F-tables.  The  power  of  detection  is  evaluated  similarly 
to  [17].’ 

b)  Multi-Comparison  Contrast  Test 

Let  us  introduce  a  contrast  vector  6gy'  =  [  0|  •••  6q  ]7,  q  =  I  +  J-  1 ,  and  the  contrast  functions 

Cj-C2,  /  =  1 

0,  =  < ’ll,  i</</;  u  =  i- 1  (516) 

L  -  L  ,  I<i  <I+J;  v  = 

where  { ^l  v }  are  as  defined  in  (3 . 1),  and  { r)j.u }  are  identical  to  { }  upon  replacing  column  effects 
by  row  effects.  Here  0}  is  the  contrast  of  frame  effects;  (0.;  1  </</}  (10,;  I<i<I+J })  measures  the 

contrast  of  the  difference  of  two  adjacent  vertical  (horizontal)  blocks  in  a  mask  to  that  in  another 
mask.  Obviously,  0(  equals  zero  if  the  masks  (located  at  different  positions  in  the  registered  scenes) 

have  the  same  texture.  Since  { 0(. }  is  an  independent  set  of  contrasts,  by  the  arguments  in  Section 
III,IV  hypotheses  testing  (5.2)  can  be  implemented  by  a  multi-comparison  test  of  whether, 
corresponding  to  0  =  /?P,  its  BLUE  estimated  =  RBy  is  within  a  confidence  ellipsoid  or  not. 
Applying  (3.6)  or  (4.5),  test  statistic  for  moving  objects  is 


&L  l  d  ><  qs2F  , 


(5.17) 


where  o2L  =  var  ( d)  =  RB  war  (y  )(RBy.  Noting  that  S(y,  ^)  /  (n-r)  =  s2  and  XQ  =Fa.qn_r,  the 

generalized  likelihood  test  (5.11)  is  equivalent  to  the  multi-comparison  test  (5.17).  Therefore,  the 
detection  of  edges  as  well  as  moving-edges  based  on  the  multi-comparison  contrast  test  yield  the 
optimal  tests.  All  remarks  pertaining  to  edge/moving-edge  detection  are  also  valid  for  moving  object 
detection.  Due  to  its  simplicity  in  implementation  as  well  as  the  physical  meaning  attached  to  the 
contrasts  (similarity  measure)  between  two  objects,  the  multi-comparison  test  is  naturally  preferred 
over  the  generalized  likelihood-ratio  test. 

c)  Practical  Aspects  Consideration 

Although  the  detector  (5.11)  or  (5.17)  is  optimal  theoretically,  it  encounters  some  implementation 
difficulties:  i)  object  displacement  from  ty  to  t2  is  unknown;  ii)  object  size  is  also  unknown.  For 
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convenience,  denote  R,,R2  as  the  top  left  comer  of  an  object’s  maximal  inscribing  rectangle  in  frame 
1,2,  respectively. 

To  circumvent  i),  we  suggest  an  exhaustive  search.  Let  C  =  icv  cv  •••,  cD }  be  a  finite  set  of 
positive  integers  and  c;,  /  =  1,  •••,  D  the  separation  between  Rl  ,R2 .  Introduce  a  Cartesian  co¬ 
ordinate  system  with  D  ,D  as  the  vertical,  horizontal  axes  centered  at  R,  and  it  is  aligned  with  the 
frames.  For  any  cz,  1=1,  D  the  loci  of  R2 ,  of  the  co-ordinates  (Dx,Dy),  are  equi-distant  lattices 
on  the  lines 

DX+Dy  =  ci>  ~Cl<DX±Cl  ,,  1R, 

Dx-Dy=-Cl,  ~c^Dx<Cl,  V-L*> 

which  represent  the  intercepts  in  the  first/second,  third/fourth  quadrants,  respectively.  Suppose  R2 
is  6  pixels  (-  cz )  away  from  Rx  and  the  search  step  a  equals  2  pixels.  Upon  substituting  Dx  =  0  a 

into  (5.18),  then  D  =6,  -6;  similarly,  substituting  Dx  =  1  a  into  (5.18),  Dy  =  4,  -4;  and  so  forth. 

As  a  matter  of  fact,  when  cz  =  6 ,  we  only  have  to  investigate  the  following  points  (relative  to  R1 ): 
(0,6),  (0,  -6),  (2,4),  (2,  -4),  (4,2),  (4,  -2),  (6,0),  (-6,0).  Summing  all  pixels  in  C,  the 
number  of  operations  amounts  to  T.f=x  4  ( [cz/  a]  -  1 },  [x]  denotes  the  integer  part  of  x. 
Recognizing  that  Rx  is  in  the  neighborhood  of  a  shifting  edge,  the  searching  process  is  repeated  for 
Rx ,  leading  to  the  total  number  of  searches  for  a  moving  object 


D  (  \c 
=  i£4  fi 

.  /=i  \  [A. 


(5.19) 


Since  the  time  separation  between  frames  1 , 2  is  presumably  short,  cz  is  of  low  value.  Clearly, 
Nsearch  is  small  as  compared  to  a  brute-force  approach,  which  is  computationally  expensive  because 

it  looks  for  all  sets  of  pixels  in  the  image.  Briefly,  moving  edge  detection  basically  reduces  the  search 
domain,  and  hence  the  computation  time  in  the  detection  of  moving  objects. 

To  bypass  ii)  we  suggest  a  rubber  mask  approach  [18],  Set  the  inscribing  rectangle  to  an 
initial  size,  for  example,  3x3.  Apply  the  exhaustive  search  and,  if  Hx  is  decided,  increment  the  mask 

outward  and  repeat  the  process.  Mask  expansion  is  stopped  whenever  the  former  decision 
switches  to  H0,  implying  that  the  mask  has  grown  too  big  as  to  include  background  pixels.  Since 

background  pixels  (in  the  context  of  the  object)  are  different  from  one  region  to  the  next,  it  is  a 
prudent  practice  not  to  include  them;  otherwise  there  will  be  a  difference  even  for  two  identical 
objects.  The  dimension  of  the  mask  responsible  for  the  current  Hx  decision  is  the  maximal  size. 


d)  Generality  of  the  New  Class  of  Detectors 

(1)  In  (5.2),  each  pixel  is  modeled  as  a  function  of  row  and  column  effects.  Based  on  (5.17)  the 
decision  of  Hx  being  true  assures  the  difference  of  all  pixels  in  one  mask  over  that  in  another,  after 

decorrelation,  is  within  a  confidence  ellipsoid.  This  is  a  direct  consequence  that  under  Hx  and  for 


Xj  *x2 ,  and/or  _yL  *y2 ,  every  pixel  at  a  particular  location  in  one  mask  equals,  within  a  confidence 

interval,  to  the  pixel  at  the  same  site  in  another  mask.  In  this  sense,  correspondence  problem  [4] 
naturally  disappears. 

To  gain  more  insight,  let  us  eliminate  both  the  row,  column  effects  in  (5.1)  and  rename  the 
resulting  equation  as  (5. 1').  Clearly,  pixel  gray  level  is  now  solely  a  function  of  frame  number.  The 
rank  of  the  effect  matrix  in  (5.1')  being  2  is  far  less  than  that  in  (5.1).  By  the  vector  space  concept 
in  Theorem  2,  the  least  squares  error  suffered  by  (5.T)  utilizing  a  lower  order  of  basis  vectors  is 
naturally  worse  than  that  in  (5. 1).  Note  that  (5.1')  is  actually  a  similarity  measure  predominantly  used 
in  the  past  [4], 

(2)  No  displacement  of  masks,  i.e.,  x2=xv  yl=y2- 

i)  Setting  /  =  1,  J=  2,  from  (5.16)  we  readily  obtain  the  contrast  functions  0j  =Ct  -  C2, 

02  =  (Yi  i  "  Yi-2)  ~  (Y2 1  ~  Y2  2)  •  Evidently,  02  is  a  measure  of  the  rate  of  gray  level  change 

at  the  same  location  in  two  sequential  frames  and,  together  with  the  body-rigidity  constraint, 
form  the  optical  flow  concept  and  is  a  popular  technique  for  object  detection  [7],  However, 
as  the  window  size  is  small,  02  performs  poorly  in  noisy  environments  (refer  also  to  Remark 

1  in  Section  III).  This  is  easily  seen  by  rewriting  the  realization  of  02  as 
@2  =  Oi;!!  “  02;ii  and  takin8  the  second  moment  to  get 

var(02)  =  E*=1£*=1>>m.ly  =  4a2. 

ii)  The  proposed  detector  in  the  form  of  (5.11)  or  (5.17)  is  a  stationary  object  detector. 
Suppose  we  further  eliminate  m ,  i.e,  ignoring  frame  effects,  then  the  new  detector  (5.11)  or 
(5.17)  corresponds  to  a  stationary  object  detector  that  does  not  take  advantage  of  the 
available  information  between  frames  (deducible  from  data).  This  is  the  same  as  the  reduction 
of  basis  vectors  by  half  and,  by  the  vector  space  approach  in  Theorem  2,  is  naturally  worse 
than  is  achieved  by  a  detector  making  use  of  frame  effects.  This  is  similar  to  [19]  where, 
because  the  mask  engulfs  the  background,  a  one-to-one  random  mapping  is  required. 

iii)  Suppose  the  third  dimension  m  represents  the  spectral  instead  of  temporal  data,  the 
resulting  detector  (5.1 1)  or  (5. 17)  becomes  the  multispectral  detector.  In  particular,  the  new 
detector  in  independent  noise  that  assumes  object  size  of  1  pixel  and  I  =  J=  1  has  been 
investigated  before  [17], 


VL  SIMULATION  RESULTS 

Fig.2  (al)-(a2)  are  two  consecutive  image  frames  of  Route  49  near  Griffiss  AFB,  Rome,  NY,  taken 
at  Va  second  apart.  Fig.2  (bl)-(b2)  are  the  corresponding  images  contaminated  by  Gaussian  noise 

JV( 0, 10).  To  detect  edges,  the  detector  i[  E[l  ^  E2l  ^  (see  eq.(3.6))  are  applied  in  four  major 

directions,  yielding  the  edges  shown  in  Fig.3  (al),(a2),  respectively.  Concurrently,  the  detector  (4.5) 
(again  four  directions)  results  in  the  moving-edges  field  as  appeared  in  Fig.3  (b).  Finally,  the 
determined  moving  objects,  based  on  detector  (5. 17),  are  marked  by  rectangles,  as  depicted  in  Fig.3 
(cl)-(c2).  The  corresponding  images  when  noise  is  of  N( 0,  50)  are  shown  in  Fig.2(bl)-(b2),  with 
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the  resulting  outputs  in  Fig.4.  In  all  cases,  the  Type-1  error  is  set  to  0.01 . 

The  superior  performance  of  the  proposed  class  of  detectors  speak  for  themselves. 


vn.  CONCLUSIONS 

We  have  derived  a  closed  form  solution  for  the  optimal  detection  of  moving  objects  embedded  in  a 
Gaussian  noise  environment.  Since  the  moving  edge  detectors,  which  in  turn  depend  on  the  edge 
detectors  tantamount  to  the  implementation  of  the  proposed  detector,  they  are  developed 
simultaneously.  The  new  detector  is  a  3-D  filter  utilizing  the  spatial  (intra-frame)  and  temporal  (inter¬ 
frames)  data.  Pixels  in  a  3-D  neighborhood  are  modeled  in  terms  of  various  effects.  This  allows  a 
large  variation  of  gray  levels  (heterogeneity)  as  compared  to  the  traditional  model  that  usually 
assumes  homogeneity  (locally).  By  the  vector  space  approach  in  Theorem  2,  all  well-known 
detectors  based  on  simpler  models  become  subclasses  of  the  proposed  detector.  Since  the  new 
detector  is  a  F-statistic,  it  is  uniformly  most  powerful  (UMP)  in  Gaussian  environment  of  unknown 
and  varying  noise  variance  [20], 

The  equivalence  of  the  generalized  likelihood  test  and  the  multi-comparison  test  (MCT)  is 
then  established.  Since  the  MCT  measures  the  contrast  between  frames  of  a  local  neighborhood  (see 
section  3(d)  i),  it  is  analogous  to  the  optical  flow  technique  without  its  drawbacks,  in  the  combat  of 
noise.  In  this  context,  the  new  detector  is  a  combination  of  the  optical-flow  and  feature-based 
techniques,  but  is  primarily  a  generalized  optical-flow  based  detector  that  takes  advantage  of  the 
feature-based  approach  (such  as  shifting  edge),  for  efficient  computation.  Because  of  the  associated 
confidence  ellipsoid  (see  (5. 17)),  the  proposed  detector  is  efficient  in  the  suppression  of  noise  and 
the  enhancement  of  similarity.  A  less  obvious  advantage  is  that  it  may  tolerate  scene  frames  that  are 
slightly  out  of  registration. 

Simulation  results  based  on  real  noisy  images  demonstrate  the  practicality  and  superiority  of 
the  proposed  detector. 
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Fig- 2 

(al)-(a2):  consecutive  scene  frames  of  a  highway; 

(bl)-(b2),  (cl)-(c2)  are  images  contaminated  by  Gaussian  noise  N( 0,  10),  N( 0,  50),  respectively 
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I  - 


V  . 


(bl) 


(Cl)  (C2) 

Fig.  3 

(al)-(a2):  edges;  (b):  moving-edges;  (cl)-(c2):  moving-objects 
detected  using  appropriate  3-D  detectors  on  the  images  in  Fig. 2  (bl)-(b2),  at  a  =  0.01 
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Fig.  4 

(al)-(a2):  edges;  (b):  moving-edges;  (cl)-(c2):  moving-objects 
detected  using  appropriate  3-D  detectors  on  the  images  in  Fig.2  (cl)-(c2),  at  a  =  0.01 
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ABSTRACT 

These  tests  will  perform  as  a  link  test  between  the  two  stations,  the  CRC, 
Ottawa  and  Rome  Labs,  GAFB,  New  York.  Equipment  configurations  for  the 
ATM  tests  as  well  as  channel  characterization  in  term  of  C/No  and  BER  as  a 
function  of  other  ATM  parameters  in  a  single  channel  and  multiple  channels  are 
studied.  Performance  analysis  are  also  includes  plots  for  the  different  QPSK  and 
8-PSK  modulations  with  or  without  error  correcting  coding. 


7-2 


Table  of  Content 


1 .  Introduction 

2.  Setup  and  Configurations 

2.1  Satellite  Transmission  Equipment 

2.2  ATM  and  Test  Equipment 

3.  Channel  Characterization 

3.1  C/No  Measurements 

3.2  BER  Measurements 

4.  ATM  Transmission 

4.1  ATM  QoS  Parameters 

4.2  Single  Channel  with  Different  Modulations 

4.3  Multiple  Channels  with  Different  Modulations 

5.  Analysis  of  Results 

6.  Conclusion 

7.  References 


Appendix  A:  Figures 


7-3 


TRANSPORT  OF  ATM-BASE  TRAFFIC  VIA  THE  ADVANCED 
COMMUNICATION  TECHNOLOGY  SATELLITE  (ACTS) 

Mostafa  Chinichian 


1.  INTRODUCTION 


The  popularity  of  Asynchronous  Transmission  Mode  (ATM)  protocol  has 
risen  to  a  new  height,  and  all  efforts  are  being  made  to  verify  its  potential  and 
usefulness.  It  is  indicated  in  [1]  that  in  the  next  decade,  most  telecommunication 
traffic  will  be  carried  by  ATM  technology.  The  usefulness  and  adaptability  of 
ATM  to  current  technology  is  now  the  subject  of  lot  of  research  in  both 
Government  and  industry.  Although  ATM  was  originally  intended  for  fiber  optic 
links  transmitting  high  speed  data,  it  can  be  used  for  other  links  such  as  the 
satellite  link,  which  is  the  subject  of  our  experiment. 

This  report  documents  the  experiment  of  transmitting  ATM  signal  via  the 
Advanced  Communication  Technology  Satellite  (ACTS).  Several  tests  were 
performed  via  a  Satellite  DS-3  link  carrying  ATM  traffic  in  a  Physical  Layer 
Convergence  Protocol  (PLCP)  frame  format  for  a  single  channel  as  well  as  for 
mult, pie  channels  of  the  ATM  via  the  ACTS.  These  experiments  are  part  of  the 
Global  Grid  Technical  Technology  Cooperation  Program  (TTCP)  network  in 
which  Rome  Laboratory  (RL)  will  communicate  via  the  satellite  with  the 
Canadians  at  the  Communication  Research  Center  (CRC)  [1],  [2],  These 
experiments  were  conducted  between  the  CRC,  Ottawa  and  Rome  Labs,  GAFB, 
New  York.  This  report  will  be  based  on  the  segment  of  the  ATM/ACTS 
experiments  that  were  conducted  from  27  June  1995  through  24  August  1995. 
The  report  documents  the  specific  tests  that  were  conducted,  presents  the 
collected  data  and  discusses  the  results  obtained. 
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2  .  SETUP  AND  CONFIGURATIONS 


The  first  stage  of  the  experiment  involved  the  characterization  of  the 
satellite  channel  for  reliable  transmission  of  ATM  signals  at  DS-3  rates  via  the 
ACTS.  The  ATM  equipment,  satellite  transmission  equipment  and  the  test 
equipment  used  during  the  experiment  as  well  as  the  characterization  procedure 
are  briefly  described  in  this  section. 

2.1  SATELLITE  TRANSMISSION  EQUIPMENT 

The  satellite  equipment  consist  of  the  EF  Data  modem  (SDM-9000 
Satellite  Modem,  Version  3.1.1),  Ka-band  Travel ing-Wave-Tube  Amplifier 

(TWTA),  Test  Loop  Translator  (TLT),  power  controllers,  and  the  high  data  rate 
antenna. 

The  satellite  transmission  equipment  which  included  the  Up/Down 
converters,  the  TLT  for  ground  loop-back,  Low  Noise  Amplifiers  (LNA),  etc. 
were  located  in  a  hut  next  to  the  antenna,  where  the  power  meter  could  be 
monitored.  The  EF  Data  modem  and  the  rest  of  the  baseband  equipment  (ATM, 
Firebird  tester,  Spectrum  Analyzer,  etc.)  were  installed  in  an  adjacent  hut 
located  about  25  ft  from  the  antenna.  An  Intermediate  Frequency  (IF)  of  70  MHz 
was  used  throughout  the  test. 

The  EF  Data  Satellite  modem  is  a  high  performance,  full-duplex,  digital- 
vector  modulator/demodulator  with  data  rate  capabilities  of  6  Mbps  to  51.84 
Mbps.  The  modem  has  built-in  scrambler/descrambers,  differential 
encoding/decoding,  multi-rate  forward  error  correction  capabilities  (convolutional 
encoder  and  Viterbi  decoder)  and  can  be  configured  to  add  over  head/framing 
to  the  data.  For  example,  a  Reed-Solomon  codec  is  provided  to  work  in 
conjunction  with  the  Viterbi  decoder  and  includes  additional  framing  and 
interleaving  for  improved  performance.  Most  of  the  equipment  used  was 
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provided  by  Canadian  on  corporation  with  Rome  Labs.  Also,  the  frequency  plan 
used  in  the  design  of  the  TLT  is  provided  by  CRC.  It  should  also  be  noted  that 
at  Rome  Labs  the  antenna  is  3.7  m  (with  a  gain  of  50.80  dBi)  while  at  Ottawa, 
Canada,  the  antenna  is  4.60  m  (with  a  gain  of  53.80  dBi).  These  antennas  were 
provided  by  CRC. 

2.2  ATM  AND  TEST  EQUIPMENT 

The  ATM  equipment  used  in  this  experiment  is  the  ADTECH  AX/4000 
Series.  The  AX/4000  Series  ATM  generator/Analyzer  provides  complete 
physical,  ATM  equipment,  switches  and  networks  for  proper  operation  and 
Quality  of  Service  (QoS)  parameter  measurements.  It  is  modulator  in  nature,  so 
it  allows  custom  configuration  for  a  variety  of  ATM  test  applications.  The  main 
components  consist  of  mainframe,  ATM  Generator/Analyzer  modules,  port 
interfaces  and  PC  controller  which  is  driven  with  Microsoft  Window-based 
software.  While  a  variety  of  port  interfaces  are  possible  (e.g.  SONET  OC-3c,  E3, 
TAXI,  DS-3,  etc.),  the  DS-3  interface  supports  45  MHz  using  electrical  format 
with  BNC  connectors  was  used  to  interface  the  ATM  equipment  with  the  modem. 

3.  CHANNEL  CHARACTERIZATION 

The  sequence  of  tests  involving  the  transmission  of  ATM  cells  via  the 
ACTS  satellite  started  on  June  27,  1995.  The  first  set  of  tests  were  aimed  at 
characterizing  the  channel  and  modem  and  also  to  familiarize  Rome  Lab 
personnel  with  the  ground  terminal  equipment.  To  evaluate  the  EF  Data  modem 
(SDM-9000  Satellite  Modem),  and  the  Firebird  DS-3  test  equipment,  a  number 
of  transmissions  involving  both  CW  testing  and  some  modulations  were  made. 
The  channel  characterization  and  modem  evaluation  are  necessary  to  ensure 
that  at  DS-3  speeds  and  in  the  Ka-band,  the  modem  specification  for  the 
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different  modulations  can  be  verified.  Also,  through  carrier-to-noise  ratio 
measurement  the  power  transfer  characterizing  of  the  ACTS  transponder  can  be 
ascertained  in  order  to  determine  appropriate  operating  points  for  the  ATM 
experiments. 

The  tests  includes  obtaining  the  modem  BER  characteristics  as  a 
function  of  the  Output  Power  Back  Off  (OPBO)  of  the  satellite  in  terms  of 
received  Eb/No.  For  each  test,  the  measurements  were  taken  for  three  modes: 

1.  Full-duplex  mode  (CRC-RL  and  RL-CRC,  both  via  ACTS). 

2.  Self  Loop-back  mode  (via-  ACTS) 

3.  Ground  Loop-back  (TLT) 

The  ACTS  operations  center  can  provide  satellite  loop-backs  for  both 
the  East  Scan  4E  spot  beam  and  the  steerable  spot-beam.  It  should  be  pointed 
out,  however,  that  during  such  loop-back  there  may  be  considerable  loss  in  the 
orthogonal  polarization  and  satellite  coverage. 

3.1  C/No  MEASUREMENTS 

During  the  CW  transmissions,  the  transmit  power  levels  at  both  ends 
(CRC  and  RL)  were  varied  (with  the  Tx  power  meter  reading  recorded)  while  the 
carrier  power  (C)  and  the  noise  power  (No)  read  from  the  spectrum  analyzer  at 
opposite  ends.  In  this  case,  the  spectrum  analyzer  received  power  is  calculated 
from 


Eb/No  =  C/No  -  10log(44.73x106 ) 
=  C/No  -76.5  dB 

where  C  and  No  are  read  from  the  spectrum  analyzer. 
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3.2  BER  MEASUREMENTS 


To  evaluate  the  modem  and  the  Firebird  BER  tester,  the  following 
modulations  were  transmitted: 


1. 

QPSK 

R  =  3/4, 

no  RS 

2. 

QPSK 

R  =  3/4, 

with  RS 

3. 

8-QPSK 

R  =  2/3, 

no  RS 

4. 

8-QPSK 

R  =  2/3, 

with  RS 

In  each  case,  both  the  received  BER  and  Eb/No  were  measured  for  each  setting 
of  the  transmit  power.  All  transmissions  included  are  convolutional  encoding  and 
corresponding  Viterbi  decoding  with  indicated  rates  (R  =  3/4  for  QPSK  and  R  = 
2/3  for  8-QPSK).  Also,  In  our  experience  convolutional  codes  with  different  rates 
(2/3  and  3/4)  are  modulated  at  IF  70  MHz  with  or  without  being  concatenated  to 
a  (208,  192)  Reed-Solomon  code. 

Also,  for  each  power  level  considered  the  BER  was  taken  from  the 
Firebird  tester  as  well  as  from  the  modem  while  the  reading  of  Eb/No  were  made 
from  the  modem. 


4.  ATM  TRANSMISSION 

The  main  aim  of  the  ATM  experiments  is  to  characterize  the  DS-3 
Satellite  link  for  the  transmission  of  ATM  signals.  Specifically,  some  ATM  QoS 
parameters  were  measured  using  different  levels  of  modulation  (QPSK,  8-QPSK) 
with  forward  error  correction  and  V.35  data  scrambling.  The  bursty  nature  of  the 


error  statistics  and  the  corresponding  large  propagation  delay  involved  in  a 
Satellite  link  affect  these  ATM  QoS  parameters.  Also,  at  the  Ka-band  (30/20 
GHz)  of  the  ACTS  the  Satellite  link  is  susceptible  to  atmospheric  effects,  the 
most  important  of  which  is  attenuation  due  to  rain.  To  properly  assess  the 
viability  of  ATM  via  Satellite,  the  cumulative  effect  of  these  Satellite  link 
characteristics  on  the  QoS  parameters  must  be  determined. 

4.1  ATM  QoS  PARAMETERS 

A  number  of  ATM  parameters  have  been  identified  as  being  very 
important  in  the  assessment  of  the  performance  of  an  ATM  network.  In  the 
experiments,  emphasis  was  placed  on  those  quality  of  service  parameters  and 
statistics  that  can  be  measured  by  the  ADTECH  ATM  test  equipment.  Some  of 
these  parameter  will  be  discussed  in  this  section. 

1 .  Bit  Error  Ratio  (BER)  -  this  is  the  Satellite  channel  bit  error  rate 
that  determines  the  rate  at  which  the  transmitted  bits  were  changed  in  the 
physical  layer. 

2.  Cell  Loss  Ratio  (CLR)  -  this  is  the  ratio  of  the  number  of  lost 
ATM  cells  sent  by  a  user  in  specified  time  interval.  Due  to  the  random  nature  of 
the  ATM  traffic  and  the  limited  size  of  the  ATM  traffic  and  the  limited  size  of  the 
buffers,  it  is  usually  possible  that  a  cell  arriving  at  a  switching  node  may  be  lost . 
Thus  CLR  are  caused  by  buffer  overflows  and  bit  error  in  the  cell  header  that 
can  be  detected  but  not  be  corrected. 

3.  Cell  Misinsertion  Ratio  (CMR)  -  this  is  defined  as  the  ratio  of  the 
cells  delivered  to  a  wrong  destination  to  the  total  number  of  cells  sent.  It  occurs 


7-9 


as  a  result  of  an  undetected  error  in  the  header  that  causes  a  change  of  the  cell 
destination. 

4.  Receive  Frame  Alarm  -  the  following  alarm  parameters  related  to 
the  receive  frame  were  measured  : 

Line  Code  Count  and  Rate 
Framing  Count  and  Rate 
P-Bit  Count  and  Rate 
FEBE  Count  and  Rate 

5.  Receive  PLCP  Alarm  -  the  following  PLCP  alarm  parameters 
were  measure  : 


Framing  Count  and  Rate 
Bl  Count  and  Rate 
FEBE  Count  and  Rate 

6.  Other  QoS  parameters  -  In  addition  to  the  above,  the  following 
QoS  parameters  were  also  measured  during  each  run  of  the  ATM  tests: 

Out  of  Sequence  Count 

PRBS  Bit  Error  Count 

PRBS  Bit  Error  Rate 

PRBS  Synchronization  Error  Count 

4.2  SINGLE  CHANNEL  WITH  DIFFERENT  MODULATIONS 


These  tests  were  performed  over  the  ACTS  Satellite  link  test  between  the 
two  stations  (CRC  at  Canada  and  RL  at  GAFB)  using  a  single  payload  data 


stream  which  fully  loaded  the  DS-3  bearer  (i.e.  96000  cells/sec)  with  an 
ADTECH  AX/4000  test-set.  All  tests  performed  with  QPSK  and  8-PSK 
modulations  (  with  convolutional  encoders  and  Viterbi  decoders)  concatenated 
with  or  without  Reed-Solomon  codes.  Also,  we  used  appropriate  cell  rate  to 
increase  observation  times. 

4.3  MULTIPLE  CHANNELS  WITH  DIFFERENT  MODULATIONS 

The  same  tests  above  as  were  performed  using  multiple  channels  data 
streams  which  together  fully  loaded  the  DS-3  bearer.  We  have  recorded  all  the 
data  in  order  to  characterize  the  DS-3  channel  for  transmission  of  ATM  signals, 
in  terms  of  QoS  parameters. 

5.  ANALYSIS  OF  RESULTS 

The  following  pictures  show  result  of  experiments,  BER  versus  Eb/No 
and  other  QoS  parameters  (i.e.  QoS/Single  Stream  and  QoS/Multiple  Streams). 
They  have  been  plotted  in  order  to  enable  reader  to  compare  BER  versus  Eb/No 
and  other  conditions  affect  ACTS  satellite  performance  parameters  (as  an 
example  few  selected  graphs  were  chosen  to  demonstrate  QoS  parameters  and 
Eb/No  in  terms  of  BER,  see  pages  7-13  to  7-20).  For  more  information  refer  to 
Final  Report  for  Summer  Research  Program,  September  1995,  C3/BA,  Rome 
Labs,  GAFB,  NY. 
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6.  CONCLUSION 


Tests  have  been  conducted  between  the  two  stations  (the  CRC,  Ottawa 
and  the  Rome  Labs,  GAFB,  New  York)  to  characterize  channel  and  ATM 
usability  and  performances.  Also,  equipment  configurations  for  the  ATM  tests  as 
well  as  channel  characterization  in  term  of  Eb/No  and  BER  as  a  function  of  other 
ATM  parameters  in  a  single  channel  and  multiple  channels  have  been  depicted 
for  different  conditions.  Performance  analysis  have  been  also  included  plots  for 
different  QPSK  and  8-PSK  modulations  (with  convolutional  encoders  and  Viterbi 
decoders)  concatenated  with  or  without  Reed-Solomon  coding. 
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Probing  Vin(PH)4  in  InP  with  Electron  Paramagnetic  Resonance 

Dennis  P.  Clougherty 

Department  of  Physics  University  of  Vermont  Burlington,  VT  05405 

Abstract 

Electron  paramagnetic  resonance  (EPR)  is  proposed  as  a  technique  to  search 
for  the  presence  of  the  hydrogen  complex  Vin(PH)4,  conjectured  to  be  a  donor 
impurity  in  InP.  A  vacancy  model  is  proposed  to  describe  the  electronic  struc¬ 
ture  of  Vin(PH)4.  In  the  absence  of  a  pseudo  Jahn- Teller  effect,  an  isotropic 
g  value  near  the  free  electron  value  is  predicted.  The  hyperfine  interaction 
arising  from  the  Fermi  contact  term  on  the  E  sites  can  be  modeled  by  an  ef¬ 
fective  spin  Hamiltonian.  The  EPR  signature  of  the  presence  of  the  Vin(PH)4 
complex  is  five  transitions  in  the  first-order  hyperfine  structure  which  are 
approximately  isotropic.  The  effect  of  isotopic  substitution  (H  — *  D)  is  dis¬ 
cussed. 
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Probing  V^PH^  in  InP  with  Electron  Paramagnetic  Resonance 

Dennis  P.  Clougherty 


I.  INTRODUCTION 

The  abundance  and  small  size  of  hydrogen  insures  its  presence  in  semiconductors.  Hy¬ 
drogen  can  appear  interstitially  in  atomic  form,  or  it  can  form  a  variety  of  complexes  either 
with  dopants  or  with  intrinsic  defects.  The  study  of  hydrogen  and  its  associated  complexes 
in  semiconductors  is  of  importance  as  it  is  known  that  its  presence  can  have  profound  effects 
on  subsequent  device  performance  and  reliability. 

Electronic  properties  of  hydrogen  complexes  depend  in  a  sensitive  way  on  their  structure 
in  the  semiconductor.  Infrared  (IR)  spectroscopy  has  proven  to  be  a  valuable  tool,  providing 
in  some  cases,  clues  to  the  structure  [1,2].  Experimentally- measured  vibrational  modes  can 
be  assigned  to  bonds.  Vibrational  multiplets  provide  information  concerning  the  symmetry 
of  the  local  environment.  Isotopic  substitution  of  deuterium  (D)  gives  rise  to  frequency  shifts 
and  symmetry-related  splittings,  providing  further  information  used  in  the  identification  of 
the  vibrational  modes.  While  IR  spectroscopy  provides  some  insight  into  structure,  it  does 
not  unambiguously  determine  structure.  Consequently,  complementary  techniques  will  be 
required  to  remove  the  ambiguity. 

Recently,  it  has  been  proposed  by  Bliss  et  al.  [2]  that  the  presence  of  a  hydrogen  complex 
in  InP,  consisting  of  an  In  vacancy  and  four  tetrahedrally  coordinated  H  atoms,  Vin(PH)4, 
can  reconcile  differences  commonly  found  between  impurity  concentration  measurements 
and  free  carrier  concentration  in  high  purity  samples.  Furthermore,  their  IR  studies  with 
deuterated  InP  find  a  vibrational  triplet,  consistent  with  a  three- fold  (t2)  vibrational  multi- 
plet  being  split  with  partial  D  substitution  for  H  atoms. 
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The  work  summarized  here  is  a  theoretical  investigation  of  the  possibility  of  using  electron 
paramagnetic  resonance  (EPR)  techniques  to  confirm  the  existence  of  the  Vin(PH)4  complex 
in  InP.  It  should  be  noted  that  EPR  has  been  used  successfully  on  InP  to  probe  Fe  impurities 
[3].  While  the  hydrogen  complex  of  interest  was  not  detected  in  these  experiments,  one  would 
not  expect  to  see  a  signal  associated  with  Vin(PH)4  as  the  operating  temperature  was  near 
room  temperature  where  the  probability  of  occupancy  of  the  defect  state  is  vanishingly 
small. 


II.  ELECTRONIC  STRUCTURE  OF  ViN(PH)4 

The  vacancy  model  for  the  electronic  structure  of  Vin(PH)4  is  proposed  below.  It  is 
assumed  that  this  complex  is  isolated  in  bulk  InP.  The  complex  is  formed  by  creating  first 
an  indium  vacancy.  This  gives  rise  to  four  vacancy  states  formed  from  the  dangling  bonds 
contributed  by  the  four  P  atoms  surrounding  the  vacancy.  Given  the  tetrahedral  symmetry  of 
the  vacancy  environment,  these  dangling  bonds  can  be  combined  to  form  a4  and  t2  orbitals, 
with  the  a4  orbital  located  inside  the  valence  band  and  the  triply  degenerate  t2  orbitals 
located  in  the  band  gap,  as  illustrated  in  Fig.  1. 

A  cluster  of  H4  arranged  on  the  vertices  of  tetrahedron  has  molecular  orbitals  of  ai  and 
t2  symmetry,  both  below  the  top  of  the  valence  band.  We  can  approximate  the  H4  orbitals 
as  a  linear  combination  of  atomic  orbitals  (LCAO).  Labeling  the  four  H  atoms  as  indicated 
in  Fig.  2,  the  a4  orbital  of  the  cluster  is  given  by 

lai)  =  b  00°)  +  I0&)  +  1 0c)  +  |0<f))  (1) 

where  |  fc)  is  an  atomic  s  orbital  centered  on  site  i.  The  t2  states  of  the  cluster  are  given  by 

IO  =  i(l0a)-|06)-|0C)  +  |0«f)) 

\v)  =  \  (10a)  +  106)  -  1 0c)  -  10(f)) 

10  =  §  (10a)  -  I0i)  +  I0e)  -  |0«f)) 
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(2) 

(3) 

(4) 


where  |£),  (77),  and  [()  are  orbitals  which  transform  as  yz,  zx ,  and  xy  under  the  operations 
of  the  group  T^. 

When  introduced  into  the  vacancy,  we  expect  the  complex  to  mix  with  the  vacancy 
orbitals,  forming  the  following  states:  (1)  ai  state  whose  character  is  bonding  between  the 
H4  cluster  and  the  four  P  neighbors  and  whose  energy  will  be  deep  in  the  valence  band;  (2) 
a  bonding  t2  manifold  below  the  valence  band  edge;  (3)  an  a^  state  which  is  H-H  bonding 
and  P-H  antibonding  and  is  located  in  the  band  gap;  and  (4)  a  tj  manifold  which  is  P-H 
antibonding.  This  electronic  structure  is  analogous  to  that  of  the  Si:Pt“  defect  considered 
by  Anderson  et  al.  [5]. 

At  suitably  low  temperature,  we  expect  the  purely  bonding  states  to  be  completely  filled, 
while  the  aj  state  contains  a  single  unpaired  electron;  the  t2  manifold  will  be  unoccupied. 
Thus  the  ground  state  configuration  of  the  complex  is  2Si/2.  The  single  electron  bound  to 
the  complex  should  have  a  non- vanishing  EPR  signal,  in  analogy  with  that  of  the  F  center 
in  alkali  halides  [6]. 


III.  EPR  SPECTRUM 

A.  The  Effective  Hamiltonian 

For  a  doublet  ground  state  in  tetrahedral  symmetry,  an  isotropic  g  factor  results.  Thus, 
the  Zeeman  interaction  is  modeled  by  an  effective  spin  Hamiltonian  of  the  form 

T-Cz  =  gfiBS  ■  B  (5) 

where  S  =  |.  It  is  anticipated  that  g  «  ge  =  2.0023,  as  the  ground  state  doublet  will  not 
mix  appreciably  via  the  weak  spin-orbit  interaction  with  the  excited  sextet.  Thus,  EPR 
should  indicate  a  single  transition  in  the  fine  structure,  Ms  =  |  —  |. 

The  hyperfine  interaction  of  the  doublet  with  the  nuclear  spins  of  the  H  atoms  introduces 
additional  transitions.  Motivated  by  Eq.  1,  the  hyperfine  interaction  can  be  viewed  as  a 
superposition  of  H  nuclei-  each  nucleus  i  with  |  -  interacting  with  the  atomic  Is  states 
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(6) 


Hhi  =  AS-YJIi 

i 

Corrections  to  the  LCAO  form  of  the  a^  wavefunction  will  result  in  hyperfine  anisotropy. 

Within  the  LCAO  approximation,  it  is  anticipated  that  the  first-order  hyperfine  spectrum 
has  5  equally  spaced  transitions,  corresponding  to  the  possible  values  of  Iz  which  can  be 
obtained  by  summing  4  spin-1/2  nuclei.  Enumerating  the  number  of  ways  of  getting  the 
various  values  of  Iz  gives  the  relative  intensity  of  these  transitions  in  the  ratio  of  1:4:6:4:1. 
The  first-order  hyperfine  spectrum  is  given  in  Fig.  3. 

The  next  nearest  shell  of  nuclei  correspond  to  the  phosphorus  atoms  adjacent  to  the 
H  atoms  of  the  cluster.  The  nucleus  of  phosphorus  is  spin- 1/2.  Inclusion  of  the  hyperfine 
interaction  with  the  phosphorus  shell  is  more  involved;  since  the  a^  orbital  contains  sub¬ 
stantial  phosphorus  p-character,  the  hyperfine  coupling  to  phosphorus  is  axially  anisotropic. 
It  is  anticipated,  however,  that  the  magnitude  of  the  elements  of  the  hj^perfine  coupling 
tensor  will  be  reduced  from  that  of  the  hydrogen  hyperfine  coupling,  as  the  amplitude  of  the 
wavefunction  on  the  phosphorus  nuclei  will  be  reduced. 

For  dilute  isotopic  substitution  of  H  — >  D,  it  is  expected  that  the  number  of  spectral 
lines  will  be  increased  to  seven.  The  three  H  atoms  will  contribute  four  lines,  corresponding 
to  the  four  possible  Iz  values  from  I  =  |,  while  the  D  atom  will  contribute  three  additional 
lines  as  D  has  a  nuclear  spin  of  1.  The  intensity  ratios  are  1:1:3:1:3:1:1.  While  for  complete 
isotopic  substitution,  nine  lines  are  expected  with  intensity  ratios  of  1:4:10:16:19:16:10:4:1. 

B.  Estimation  of  Parameters 

The  energy  difference  between  the  orbital  singlet  and  triplet  states  is  an  important  pa¬ 
rameter  in  the  analysis.  We  can  approximate  this  difference  by  sphericalizing  the  tetrahedral 
arrangement  of  H  atoms:  we  envision  the  nuclear  charge  ( Z  —  4)  as  being  smeared  out  into 
a  spherical  shell.  This  technique  of  sphericalizing  high  symmetry  structures  has  recently 
been  used  successfully  on  other  clusters  [4]. 
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The  process  of  approximating  the  nuclear  potential  by  its  angular  average  is  tantamount 
to  truncating  the  expansion  of  the  potential  in  multipole  moments  after  £  =  0.  For  a  cluster 
with  Ta  symmetry  and  one-electron  states  which  transform  as  ai  and  t2,  it  is  not  necessary 
to  expand  the  nuclear  potential  beyond  the  £  =  0  moment,  as  the  multipole  moments  £  = 
1,  2  vanish. 

The  radius  of  the  sphere  is  estimated  by  using  known  bond  lengths  for  the  In-P  bond 
(d/n_p  ~  2.54  A)  and  the  P-H  bond  (dp-u  ~  1.60  A).  The  radius  of  the  sphere  for  the  H4 
cluster  in  the  In  vacancy  is  approximately  R  «  d/n_p  —  dp_#  «  0.94  A. 

Consider  first  the  eigenenergies  of  the  H4+  within  the  spherical  approximation.  The  lone 
electron  experiences  a  cut-off  Coulomb  potential  given  by 


= 


Zej R  0  <  r  <  R 


(7) 


Ze/r  r  >  R 

The  eigenenergy  of  the  Is  state  for  this  potential  is  Eb  «  —2.96  eV,  roughly  one-fifth  as 
binding  as  the  Is  energy  for  H. 

Once  this  H4  cluster  is  introduced  into  the  In  vacancy,  the  surrounding  semiconductor 
screens  the  potential,  and  the  binding  energy  will  be  further  reduced,  giving 


E;  - 


Eb 


6  "  e2 


(8) 


where  e,  the  dielectric  constant  of  InP,  is  roughly  10.  This  yields  an  estimate  of  the  binding 
energy  of  E%  ~  30  meV.  Thus,  any  EPR  signal  for  the  complex  will  only  appear  at  temper¬ 
atures  well  below  room  temperature,  where  there  is  substantial  occupancy  of  the  impurity 
state. 

We  estimate  the  isotropic  H  hyperfine  parameter  by  considering  the  magnitude  of  the 
Fermi  contact  term  [7]  for  the  hydrogen  Is  state,  Ais 


2  4 

mec  a 


1420  MHz 
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(9) 

(10) 


As  the  H4  cluster  has  its  lone  electron  distributed  among  the  four  H  atoms,  we  estimate 
that  A  ~  Als/ 4  ps  355  MHz.  This  should  roughly  be  the  spacing  of  the  hyperfine  spectral 
lines. 


IV.  DISCUSSION 

The  analysis  summarized  does  not  include  several  interesting  effects.  The  fine  struc¬ 
ture  was  found  to  be  isotropic;  however,  pseudo  Jahn- Teller  (PJT)  distortions  arising  from 
vibronic  coupling  of  the  a4  and  t*2  states  may  introduce  fine  structure  anisotropy  and  sub¬ 
stantial  shifts  in  the  g- value. 

As  noted  previously,  the  hyperfine  coupling  to  the  phosphorus  nuclear  spins  was  not 
included.  This  will  certainly  add  transitions,  and  it  may  well  wash  out  some  of  the  struc¬ 
ture  resulting  from  the  H  hyperfine  transitions;  however,  this  kind  of  inhomogeneous  line 
broadening  can  be  circumvented  experimentally  with  the  use  of  ENDOR  [8],  rendering  all 
the  hyperfine  transitions  visible. 

Future  work  would  include  a  study  of  the  linewidths,  together  with  the  PJT  analysis.  In 
that  regard,  it  should  be  noted  that  as  the  experiments  are  performed,  additional  theoretical 
analysis  will  certainly  be  required  in  refining  the  theoretical  model. 
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FIG.  1.  One-electron  states  of  (a)  the  In  vacancy,  (b)  the  H4  cluster,  and  (c)  the  Vin(PH)4 


impurity  in  InP. 
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FIG.  2.  Geometry  of  the  Vin(PH)4  impurity  in  InP. 
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(a) 


(b) 


(c) 

FIG.  3.  EPR  spectrum  (intensity  vs.  frequency)  of  (a)  Vin(PH)4  impurity  containing  first-order 
hyperfine  structure;  (b)  Vin(PH)3(PD)  impurity  resulting  from  dilute  H  — *  D  substitution;  (c) 
Vjn(P D)4  impurity  resulting  from  complete  H  — »  D  substitution.  Relative  intensities  within  mul- 
tiplet  are  approximately  to  scale,  conveying  multiplet  shape. 


8-12 


A  STUDY  OF  SIMPLE  AND  EFFICIENT  TECHNIQUES  FOR  LOSSLESS  AND 
NEAR-LOSSLESS  COMPRESSION  OF  DIGITIZED  IMAGES 


Manohar  K.  Das 
Associate  Professor 

Department  of  Electrical  and  Systems  Engineering 


Oakland  University 
Rochester,  MI  48309-4401 


Final  Report  for: 

Sommer  Faculty  Research  Program 
Rome  Laboratory 


Sponsored  by: 

Air  Force  Office  of  Scientific  Research 
Bolling  Air  Force  Base,  Washington  DC 

and 

Rome  Laboratory 


August  1995 
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Abstract 

This  report  introduces  two  simple  techniques  for  lossless  and  near-lossless  compression  of  digitized  images  and 
presents  a  comparative  study  of  the  proposed  schemes  and  some  of  the  existing  ones.  The  proposed  new  techniques 
represent  adaptive  and  hierarchical  versions  of  the  conventional  fixed  Differential  Pulse  Code  Modulation  (DPCM) 
scheme.  These  are  called  Suboptimal  Adaptive  DPCM  (SADPCM)  and  Hierarchical  Block-Adaptive  DPCM 
(HBADPCM)  methods,  respectively. 

The  SADPCM  lossless  coders  are  mainly  useful  for  archival  and/or  transmission  of  still  image  frames  without 
loss  of  any  data,  whereas  their  near-lossless  versions  are  suitable  for  applications  where  loss  of  some  visually 
imperceptible  details  may  be  tolerable  without  compromising  safety.  On  the  other  hand,  HBADPCM  coders  provide 
efficient  tools  for  hierarchical  transmission,  which  allows  either  accessing  the  image  at  progressively  improving  quality 
and/or  resolution  levels,  or  accessing  it  gradually  according  to  preferred  areas  of  interest.  Such  coders  are  intuitively 
appealing  because  they  can  achieve  significant  savings  in  the  overall  transmission  cost  by  allowing  fast  browsing  of  an 
image  database  by  reducing  the  volume  of  unwanted  transmission. 

The  performances  of  the  proposed  coders  are  compared  with  two  of  the  most  popular  existing  schemes  known 
as  the  fixed  DPCM  and  Hierarchical  Interpolation  (HINT)  methods.  The  results  of  the  experimental  studies  indicate  that 
both  SADPCM  and  HBADPCM  coders  perform  better  than  the  other  methods  considered.  Since  the  computational 
complexities  of  the  proposed  coders  are  reasonably  low,  it  is  believed  that  these  should  find  widespread  use  in 
applications  involving  lossless  and  near-lossless,  or  hierarchical  image  compression. 
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A  STUDY  OF  SIMPLE  AND  EFFICIENT  TECHNIQUES  FOR  LOSSLESS  AND 
NEAR-LOSSLESS  COMPRESSION  OF  DIGITIZED  IMAGES 


Manohar  Das 


1.  INTRODUCTION 

The  sheer  explosion  of  data  resulting  from  the  digitization  process  poses  a  formidable  problem  for  widespread 
use  of  digital  techniques  for  transmission  and/or  archival  of  raw  images.  For  instance,  a  1024x1024  image,  quantized  to 
8  bits  per  pixel,  requires  storage  and/or  transmission  of  approximately  1 .05  megabytes  of  picture  data.  The  lossless 
transmission  of  such  an  image  over  a  19.2  kilobits/second  channel  requires  approximately  7.3  minutes,  which  is  rather 
long.  One  of  the  most  promising  solution  to  the  data  explosion  problem  is  afforded  by  image  coding  techniques,  which 
has  been  a  popular  research  topic  for  the  past  two  decades.  As  a  result  of  the  concerted  efforts  of  several  researchers,  a 
multitude  of  different  image  coding  techniques  have  emerged  [l]-[4],  most  of  which  can  be  broadly  categorized  into 
three  classes;  namely,  lossy,  lossless,  and  near-lossless.  A  lossless  scheme  typically  achieves  a  compression  ratio  of  the 
order  of  two  only,  but  will  allow  exact  recovery  of  the  original  image  from  the  compressed  version;  a  lossy  scheme  will 
not  allow  exact  recovery,  but  can  attain  much  higher  compression  ratios,  e.g.,  fifty  or  more.  Finally,  a  near-lossless 
scheme  can  deliver  higher  compression  compared  to  the  lossless  ones,  and  at  the  same,  the  reconstruction  error  for  each 
pixel  is  guaranteed  to  lie  within  some  predefined  limits,  such  as,  ±1,  ±2,  etc. 

Most  of  the  recent  advances  in  image  coding  have  taken  place  in  the  area  of  lossy  compression.  The  currently 
popular  methods  include  transform  coding  [2], [3], [5],  vector  quantization  [6], [7],  multi-resolution  coding  [8]-[10], 
subband  coding  [11],  and  adaptive  predictive  coding  (APC)  [1],[2],[5],[12].  From  the  viewpoint  of  algorithmic 
complexity,  the  APC  techniques  are  simpler,  but  they  can  only  achieve  moderately  good  compression  ratio.  Thus,  for 
low  bit  rate  lossy  coding  applications,  the  other  techniques  are  preferable. 

Although  lossy  compression  schemes  offer  best  compression,  they  are  deemed  to  be  unsuitable  for  medical, 
analytical,  and  archival  applications,  because  uncontrolled  loss  of  any  information  is  not  tolerable  in  these  situations. 
Therefore,  only  lossless  or  near-lossless  schemes  are  suitable  for  these  applications. 

As  compared  to  lossy  compression,  the  lossless  case  presents  a  quite  different  scenario.  Although  theoretically, 
any  lossy  technique,  e.g.,  transform  coding  or  subband  coding,  can  be  used  for  lossless  transmission  by  also  transmitting 
the  quantization  errors,  as  Roos  et  al  [13]  point  out,  this  results  in  a  rather  small  compression  ratio.  Thus,  only  a  handful 
of  different  lossless  coding  techniques  and  their  variants  have  appeared  in  the  literature  till  this  date.  All  of  them  use 
either  a  predictive  model  [13]-[17],  or  a  multi-resolution  image  model  [13],[14],[18]-[20]  to  reduce  the  statistical 
redundancy,  and  then  encode  the  residuals  using  an  optimal  encoder,  e.g.,  Huffman  [21],  adaptive  Huffman  [22], 
Lempel-Ziv  [23],  or  arithmetic  coding  [24].  For  images  digitized  to  256  gray  levels,  these  techniques  typically  achieve  a 
lossless  compression  ratio  of  1.5  to  2.5. 

There  are  certain  applications  involving  image  analysis,  where  loss  of  some  quality  may  be  tolerable  without 
compromising  safety.  Examples  range  from  relatively  less  quality-constrained  applications,  such  as,  preliminary  scene 
analysis,  to  relatively  high  quality-critical  applications,  where  some  loss  in  individual  pixel  values  is  tolerable  only  if  it  is 
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guaranteed  to  be  within  some  quantifiable  limits  set  by  the  user.  The  latter  applications,  in  particular,  need  more  than  just 
high  quality  lossy  compression,  because  commonly  used  lossy  coders  fall  short  of  providing  any  guarantee  on  the  loss 
incurred  at  each  and  every  pixel;  in  fact,  they  can  only  deliver  a  reconstructed  picture  conforming  to  a  desired  overall 
subjective  quality  level.  Near-lossless  coders  are  particularly  well  suited  for  the  above  applications,  because  they  offer 
significant  savings  in  storage  or  transmission  cost  by  delivering  higher  compression  compared  to  the  lossless  ones,  and 
at  the  same  time,  the  reconstruction  errors  for  individual  pixels  are  guaranteed  to  lie  within  some  predefined  limits  set  by 
the  user.  Typically,  such  a  technique  consists  of  a  combination  of  a  predictive  or  interpolative  lossless  coder  and  a 
uniform  scalar  quantizer  [25], [26].  The  compression  gain  attainable  over  a  lossless  coder  depends  on  the  maximum 
allowable  reconstruction  error,  n.  It  can  be  shown  [26]  that  for  small  values  of  n,  a  near-lossless  coder  achieves  a 
compression  gain  of  approximately  log2(2n+l)  bits  less  than  the  underlying  lossless  coder.  For  instance,  if  n  =  1,  this  is  a 
savings  of  log23  =  1.58  bits  per  pixel,  and  if  n  =  2,  this  is  a  savings  of  approximately  2.3  bits/pixel. 

In  yet  other  situations,  another  kind  of  image  compression  methods,  known  as  progressive  coders,  is  required. 
In  progressive  coding,  first  a  coarse  reproduction  of  the  image  is  constructed  using  a  small  fraction  of  the  pixels,  and  this 
coarse  reproduction  is  gradually  updated  as  the  remaining  pixels  arrive.  Progressive  coding  is  particularly  useful  when 
browsing  a  large  image  data  base,  because  in  this  case  the  user  can  quickly  determine  from  the  coarse  version  of  the 
image  whether  the  image  is  an  appropriate  one.  If  not,  the  user  can  realize  significant  cost  savings  by  stopping  the 
transmission  of  the  finer  versions  of  the  image. 

This  study  encompasses  development  of  software  codes  for  some  existing  lossless  and  near-lossless  image 
coders,  evaluation  of  their  performance,  and  investigation  of  some  improved  coding  schemes.  The  specific  objectives 
are  summarized  below. 

2.  OBJECTIVES  OF  THIS  STUDY 

The  three  major  objectives  of  this  study  are: 

i.  Development  of  "C"  codes  for  some  lossless  and  near-lossless  coders  for  incorporation  into  IE2000 
Toolkits 

This  involves  selection  of  some  existing  or  new  algorithms,  and  coding  them  in  "C"  for  incorporation 
into  EE2000  Toolkits. 

ii.  Development  of  new,  improved  lossless/near-lossless  image  coders 

This  involves  investigation  of  new  ideas  for  improvement  of  the  coding  performance  of  the  existing 
algorithms.  Both  full-frame  and  progressive  coding  schemes  are  targeted  for  investigation. 

iii.  Comparative  performance  evaluation  of  the  existing  and  new  coding  schemes 

This  involves  testing  the  performance  of  the  existing  and  new  coders  on  a  variety  of  images  including 
standard  test  pictures,  medical  images,  and  IE2000  pictures. 

The  study  of  the  above  objectives  is  the  focus  of  the  following  sections. 

3.  SOFTWARE  CODES  DEVELOPED  FOR  INCORPORATION  INTO  IE2000  TOOLKITS 

Although  a  wide  variety  of  available  coding  algorithms  were  tested,  the  usable  "C"  codes  were  developed  for 

three  lossless  and  three  near-lossless  ones  only.  These  are  called  space-varying  mean  (SVM)  method,  fixed  differential 
pulse  code  modulation  (DPCM)  technique,  and  an  adaptive  DPCM  (ADPCM)  method.  Among  these,  the  SVM  and 
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DPCM  belong  to  the  category  of  existing  methods,  whereas  the  selected  ADPCM  technique  is  a  new  one.  The  SVM 
method  finds  a  rounded  and  equally  weighted  local  mean  (alternatively,  median)  of  a  pixel  from  its  four  nearest  causal 
neighbors  [5],  subtracts  it  from  the  original  pixel  value  to  calculate  the  residual,  and  transmits  the  residuals  to  the 
receiver.  The  DPCM  technique  [5],[13]  is  similar  to  SVM,  but  uses  an  unequally  weighted  local  mean  of  only  three 
nearest  causal  neighbors  of  a  pixel.  The  ADPCM  technique  is  a  new  suboptimal  technique,  the  discussion  of  which  is 
postponed  until  Section  4. 

Each  of  the  above  coders  consists  of  two  basic  modules:  a  compressor  and  an  expander.  The  compressor 
module,  in  turn,  consists  of  three  sub-modules;  namely,  a  residual  generator,  an  integer-to-bit-stream  converter,  and  a 
shifted  Huffman  encoder.  The  residual  generator  constructs  integer  residual  values  using  any  one  of  the  algorithms 
mentioned  earlier,  the  integer-to-bit-stream  converter  converts  the  residuals  first  into  a  string  of  ASCII  characters  and 
then  into  a  bit-stream,  and  finally,  the  shifted  Huffman  encoder  encodes  the  residual  bit-stream  using  either  a  plain 
Huffman  coder  or  an  adaptive  one.  Similarly,  the  expander  module  consists  of  three  sub-modules;  namely,  a  shifted 
Huffman  decoder,  a  bit-stream-to-integer  converter,  and  an  image  reconstructor,  the  purposes  of  which  are  self- 
explanatory. 

There  are  two  program  sub-modules  that  are  shared  by  all  the  coders;  namely,  huff.c  and  bitio.c,  which  are 
used  for  Huffman  coding/decoding  and  input/output  of  bit-streams,  respectively.  The  remaining  modules  are: 
svmean-c.c  (for  SVM  compression  algorithm), 
svmean-e.c  (for  SVM  expansion  algorithm), 
fdpcm-c.c  (for  fixed  DPCM  compression  algorithm), 
fdpcm-e.c  (for  fixed  DPCM  expansion  algorithm), 
badpcm-c.c  (for  backward  ADPCM  compression  algorithm), 
badpcm-e.c  (for  backward  ADPCM  expansion  algorithm). 

The  compiled  programs  are  called  svmean-c,  svmean-e,  fdpcm-c,  fdpcm-e,  badpcm-c,  and  badpcm-e,  respectively.  Each 
of  the  above  programs  can  be  executed  as  follows: 
program-name  input  output, 

which  assumes  "input"  is  the  file-name  for  the  original  image  and  "output"  is  the  file-name  for  the  compressed  image. 
The  programs  also  ask  for  the  row  and  column  dimensions  of  the  input  image,  because  these  are  currently  meant  to  be 
usable  with  raw  images  only. 

The  near-lossless  versions  of  the  above  coders  are  organized  in  an  exactly  similar  fashion,  and  the  executable 
programs  are  called  svmean-nl-c,  svmean-nl-e,  fdpcm-nl-c,  fdpcm-nl-e,  badpcm-nl-c,  and  badpcm-nl-e,  respectively. 
The  usage  of  these  programs  is  similar  to  their  lossless  counterparts,  except  that  in  this  case  the  user  has  to  specify  the 
maximum  desired  error  as  well. 

In  terms  of  relative  performance  of  the  three  lossless  programs,  usually  badpcm  performs  slightly  better  than 
fdpcm,  whereas  svmean  performs  somewhat  poorly.  However,  the  current  version  of  badpcm  is  also  the  slowest  among 
the  three,  because  it  contains  a  number  of  redundant  arithmetic  calculations  which  should  be  removed  to  generate  an 
optimized  version  of  the  program.  Finally,  what  was  said  above  about  the  relative  performance  of  the  three  lossless 
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coders,  is  also  true  for  the  near-lossless  ones.  However,  in  this  case,  usually  badpcm  performs  somewhat  better  than  the 
other  two. 

4.  NEW  TECHNIQUES  FOR  LOSSLESS/NEAR-LOSSLESS  IMAGE  COMPRESSION 

As  mentioned  before,  during  the  course  of  this  study,  two  new  techniques  for  lossless  and  near-lossless  image 
compression  were  developed.  These  are  called  suboptimal  adaptive  DPCM  (SADPCM)  and  hierarchical  block-adaptive 
DPCM  (HBADPCM),  respectively.  First  SADPCM  is  discussed  below,  which  is  followed  by  HBADPCM. 

4.1  A  Suboptimal  Adaptive  DPCM  Image  Modeling  and  Estimation  Approach 

To  begin  with,  we  assume  that  a  two-dimensional  (2-D)  digitized  image  can  be  regarded  as  a  nonstationary  2-D 
signal  consisting  of  pixel  intensity  values,  {f(ij),  l<i<L,  l<j<L],  where  i  denotes  the  row  index  and  j  stands  for  the 
column  index,  respectively.  A  conventional  2-D  DPCM  compression  scheme  uses  an  image  model  of  the  following 
form: 

f(ij)  -  a,f(i,j-l)  -  a2f(i-l,j)  +  a,a2f(i-l,j-l)  =  w(ij),  (1) 

where  ai  and  a2  denote  the  model  coefficients,  and  w(i,j)  denotes  the  modeling  error,  which  is  usually  regarded  as  a  zero- 
mean,  2-D  white  noise  sequence.  In  case  of  conventional  fixed  DPCM  schemes  [13],  it  is  customary  to  set  a!  =  a2  and  a 
commonly  used  value  for  both  coefficients  is  0.95  [13].  Henceforth,  this  scheme  is  referred  to  as  the  fixed  DPCM 
technique. 

In  order  to  develop  a  suboptimal  adaptive  DPCM  (SADPCM)  image  model,  first  rewrite  (1)  in  the  form  of  a 
multiplicative  autoregressive  model  [14], 

(1  -  a2q,'‘)(l  -  aiq2'‘)  f(i,j)  =  w(ij),  (2) 

where  qf1  and  q2 1  denote  the  unit  backward  shift  operators  in  the  vertical  (i.e.,  row-wise)  and  horizontal  (i.e.,  column¬ 
wise)  directions,  respectively.  Because  of  the  multiplicative  nature  of  the  polynomial  operator  in  the  left  side  of  (2),  we 
can  further  express  (2)  in  the  cascade  form, 

(1 -aIq21)f(i,j)  =  f1(i,j),  (3a) 

(1  -  a2q,'!)  fj(ij)  =  w(i,j),  (3b) 

where  fi(i,j)  is  an  intermediate  signal  regarded  as  the  output  of  the  first  stage  of  the  cascade  structure. 

In  order  to  be  useful  for  adaptive  image  coding,  the  coefficients  at  and  a2,  of  model  (3)  must  be  estimated  from 
the  given  image  data.  As  discussed  in  [14],  an  optimal  method  of  estimating  a{  and  a2  involves  either  minimization  of  a 
nonlinear  cost  function,  or  a  pseudo-linearization  approach.  However,  both  of  these  approaches  are  beset  with 
moderately  high  computational  cost,  and  therefore,  they  are  unsuitable  for  our  purpose.  Instead  we  use  a  suboptimal 
estimation  scheme,  as  described  below. 

4.1.1  A  suboptimal  scheme  for  estimation  of  the  model  coefficients 

The  selection  of  a  suboptimal  estimation  scheme  depends  on  the  trade-off  between  performance  and 
computational  complexity.  For  the  simplest  coders,  presented  in  this  section,  we  use  a  fixed  value  for  a\  and  an  estimated 
value  for  a2,  whereas  for  the  improved  coders,  presented  in  a  subsequent  section,  estimated  values  of  a\  and  a2  are 
utilized.  To  find  the  estimates  of  both  a*  and  a2>  we  used  the  following  three-step  suboptimal  strategy: 

Step  1.  Estimate  a!  from  (3a)  ignoring  the  correlation,  if  any,  between  fj(i,j)  and  f(i,j); 
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Step  2.  Using  the  estimated  value  of  ah  evaluate  the  residuals  of  the  first  stage  to  form  an  estimate,  fi(ij),  of 
f,(Lj)  as, 

?,(ij)  =  f(i,j)-a,f(i,j-l); 

Step  3.  Finally,  estimate  a2  from  (3b). 

Notice  that  the  above  procedure  is  a  suboptimal  one  because  the  estimate  of  aj  is  biased  due  to  non-zero  correlation 
between  f(i,j)  and  fi(i,j).  However,  it  is  still  useful  for  predictive  coding  because  of  two  reasons:  i)  in  a  predictive  coder, 

the  modeling  inadequacies  are  masked  to  a  large  extent  by  adding  the  quantized  residual  errors  back  to  the  predicted 

pixel  values,  and  ii)  the  error  cannot  grow  provided  the  estimated  model  is  stable. 

Using  the  above  strategy,  estimates  of  a]  and  a2  are  simply  obtained  as, 

a,  =  rf(0,l)/rf(0,0),  (4a) 

a2  =  rn(l,0)/rfl(0,0),  (4b) 

where  Tf(k,l)  and  rn(k,l)  denote  the  autocorrelation  coefficients  with  'lag  (k,l)'  of  f(i,j)  and  f](i,j),  respectively.  Notice  that 
for  raw  image  data,  the  value  of  almost  always  lies  between  0.95  and  1.0,  which  represents  the  typical  range  of 
normalized  column-wise  autocorrelation  coefficients  for  most  images.  Thus,  for  the  simplest  SADPCM  coders,  which 
process  f(i,j)  directly,  the  value  of  a\  can  be  assumed  to  be  a  constant.  In  our  first  set  of  SADPCM  experiments,  the 
value  of  a!  was  kept  fixed  at  0.99,  and  only  a2  was  estimated  using  steps  (2)  and  (3)  mentioned  above.  It  may  be 
mentioned  that  the  choice  of  a  fixed  value  for  a!  is  not  very  critical;  in  fact,  only  slight  differences  in  the  overall 
performance  were  noticed  by  assigning  different  constant  values  to  aj  over  the  range,  [0.97, 1.0). 

Next,  notice  that  the  estimation  of  a2  can  be  carried  out  either  in  a  forward,  or  backward  fashion.  In  the  forward 
scheme,  the  image  is  first  subdivided  into  smaller,  say  MxM,  blocks,  and  a  separate  set  of  coefficients  is  estimated  for 
each  block.  In  this  case,  the  blockwise  estimates  of  a2  need  to  be  transmitted  as  the  side  information.  On  the  other  hand, 
in  the  backward  scheme,  the  estimation  of  a2  is  carried  out  over  a  causal  window  consisting  of  a  few  reconstructed  pixel 
values,  and  therefore,  avoids  the  necessity  of  transmitting  any  side  information.  The  overall  coding  schemes  based  on 
forward  and  backward  estimation  techniques  are  henceforth  referred  to  as  forward  ADPCM  (FADPCM)  and  backward 
ADPCM  (BADPCM),  respectively. 

In  the  forward  scheme,  two  alternative  estimation  strategies  are  useful;  namely,  batch  and  recursive  methods. 
The  batch  method  involves  blockwise  estimation  of  rn(l,0)  and  rn(0,0),  followed  by  evaluation  of  a2  from  (4b).  The 
recursive  method,  on  the  other  hand,  involves  calculation  of  a2  using  a  recursive  estimation  algorithm,  such  as,  recursive 
least  squares  (RLS)  or  least  mean  squares  (LMS)  [27].  Since  we  need  to  estimate  a  single  parameter,  the  use  of  RLS  is 
preferred  because  it  converges  faster  and  involves  only  a  little  extra  computation  than  LMS.  The  RLS  estimation  update 
equations  can  be  summarized  as  follows: 


error  update: 

e(i,j)  =  f((i,j)  -  a2fi(i-l,j), 

(5a) 

covariance  update: 

Pu  =  p°/g. 

(5b) 

normalizing  gain: 

g=l+p°[f1(y)]2, 

(5c) 

parameter  update: 

a2u=a2°  +  puf1(i,j)e(i,j), 

(5d) 

where  the  subscripts  'o'  and  'u'  refer  to  the  old  and  the  updated  values  of  the  associated  variables. 
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In  the  backward  scheme,  the  use  of  a  recursive  estimation  technique  is  mandated,  and  two  alternative  strategies 
were  found  to  be  useful.  The  first  method  uses  either  RLS  or  LMS  to  update  the  a2  estimate  for  each  pixel.  In  the  second 
method,  first  the  values  of  rn(l,0)  and  rfl(0,0)  are  updated  over  a  sliding  causal  window,  and  then  pixel-by-pixel 
estimates  of  a2  are  calculated  from  (4b).  The  window  used  in  our  experiments  is  shown  in  Fig.  1 .  Using  this  window,  the 
estimated  values  of  rfI(0,0)  and  rn(l,0)  for  the  (i,j)Ih  pixel  are  obtained  as, 

rn(0,0)  =  (II  [f i  (i-m,j-n)]  2)/7 .0,  (6a) 

m,neS 

rn(l»0)  =  [fi(ij-l)fi(i-lj-l)  +  fi(i-l  j-l)fi(i-2j-l)  +  f1(i-l,j)f1(i-2,j)  +  f,(i-l,j+l)f,(i-2,j+l)]  /4.0,  (6b) 

where  S  =  {(m,n)  I  me  [0,2],  ne[-l,l],  and  n  #  0,  or  1  when  m=0}.  It  may  be  mentioned  that  because  of  the  partial 
overlap  of  the  windows  for  two  neighboring  pixels,  the  actual  computational  load  required  for  implementation  of  (6a) 
and  (6b)  is  only  about  five  multiplications  and  four  additions  per  pixel,  which  can  be  reduced  further  by  choosing  a 
window  of  smaller  size. 

In  both  forward  and  backward  approaches,  the  robustness  of  the  overall  scheme  is  greatly  improved  by  adding 
a  stability  check  for  a2,  the  estimated  value  of  a2,  at  every  step.  Notice  that  the  stability  of  the  estimated  model  is 
guaranteed  if 

la2l  <  1 .  (7) 

If  (5)  is  violated,  the  estimate  is  first  projected  inside  the  stable  zone  before  continuing  with  prediction  and  encoding 
steps. 

Next,  lossless  and  near-lossless  image  coding  schemes  based  on  the  above  SADPCM  modeling  approach  is 
presented  below. 

4.2  Lossless  and  Near-Lossless  Image  Coding  Using  SADPCM  Modeling  Approach 
4.2.1  Lossless  coding 

As  mentioned  earlier,  two  different  coding  schemes  are  possible;  namely,  FADPCM  and  BADPCM.  Since  they 
differ  only  in  the  estimation  methodology,  we  summarize  the  overall  coding  scheme  below  for  a  generic  SADPCM 
model  only. 

A  lossless  image  coder  based  on  the  SADPCM  model  consists  of  two  main  components,  namely,  a  predictor 
and  an  entropy  coder.  The  predictor  simply  uses  (1)  to  calculate  a  predicted  value  of  f(i,j)  as, 

f(ij)  =  0.99f(i,j-l)  +  a2f(i-l,j)  -  0.99a2f(i-l,j-l),  (8) 

where  ai  is  chosen  to  be  0,99  and  a2  denotes  the  estimated  value  of  a2.  Next,  f(i,j)  is  rounded  to  generate  the  integer 
predicted  values,  fr(ij),  i.e., 

fr(i,j)  =  R[f(i  J )],  (9) 

where  R[x]  denotes  the  nearest  integer  value  of  x.  The  residual  signals,  d(i,j),  are  then  obtained  as, 
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d(i,j)  =  f(i  j)  -  fr(i,j).  (10) 

Finally,  the  residual  sequence,  (d(i,j)},  is  entropy  coded  using  an  optimal  encoder,  e.g.,  adaptive  Huffman  [22], 
or  arithmetic  coding  [24]  and  the  coded  residuals  are  transmitted  to  the  receiver.  At  the  receiver  end,  f(i,j)  is  exactly 
reconstructed  by  first  synthesizing  fr(i,j)  using  (8),  (9)  and  then  utilizing  the  decoded  d(i,j)  to  obtain 

f(i,j)  =  fr(i,j)+d(ij).  (11) 

4.2.2  Near-lossless  coding 

As  mentioned  before,  the  near-lossless  coders  attempt  to  improve  the  compression  efficiency  of  lossless  coders 
by  allowing  a  controlled  loss  in  the  reconstructed  image.  A  specific  class  of  such  coders,  which  consists  of  a  lossless 
coder  followed  by  a  uniform  scalar  quantizer,  was  introduced  in  [25], [26].  Since  the  same  strategy  is  utilized  here  and 
details  are  available  in  [25],[26],  only  the  main  ideas  are  summarized  below. 

In  the  case  of  near-lossless  coders,  the  predicted  value  of  f(i  j)  is  obtained  as, 

f(ij)  =  0.99?(i,j-l)  +  a2f(i-l,j)  -  0.99a2f(i-l,j-l),  (12) 

where  f(m,n)  denotes  the  reconstructed  value  of  f(m,n).  Next,  the  prediction  error,  d(i,j),  and  its  quantized  value,  d(ij), 
are  computed  as 

d(i,j)  =  f(i,j)-R[fri,j)],  (13a) 

<f(ij)  =  Q[d(ij)],  (13b) 

where  Q[.]  denotes  a  center-clipping  quantizer  whose  input-output  relationship  is  dictated  by  the  maximum  allowable 
reconstruction  error  (MARE).  If  MARE  is  ±n,  Q  is  chosen  as, 

Q[k]  =  m  for  integer  ke  [m-n,m+n],  (14) 

where  m=0,  ±(2n+l),  ±(4n+l),  etc. 

Finally,  the  reconstructed  pixel,  f(i  j),  of  f(i,j)  is  obtained  as 

f(i,j)  =  R[f(i,j)]+d(i,j),  (15) 

A . . 

and  at  the  end,  after  repeating  the  above  steps  in  a  recursive  fashion,  the  sequence  { d(i,j)}  is  entropy  coded  and 

/V 

transmitted.  At  the  receiver,  the  reconstructed  image  is  obtained  recursively  by  first  computing  f(i,j)  from  (12),  rounding 

it  to  the  nearest  integer  and  then  adding  the  same  to  the  decoded  value  of  d(i,j),  as  in  (15).  This  compression  scheme  is  a 

near-lossless  one  because  by  choosing  'n'  in  equation  (15)  to  be  small,  a  nearly  perfect  version  of  the  original  image  can 
be  reconstructed  at  the  receiver  end. 

4.2.3  Entropy  coding  of  the  residua  Is 

As  mentioned  before,  the  residuals  of  both  lossless  and  near-lossless  coders  can  be  entropy  coded  using  either 
Huffman,  adaptive  Hufftnan,  or  arithmetic  coders.  In  the  initial  stage  of  our  experiments,  we  performed  a  comparative 
study  between  plain  Huffman  and  its  adaptive  version,  and  found  that  the  performance  of  plain  Hufftnan  is  only 
marginally  inferior  compared  to  its  adaptive  counterpart,  which  achieves  an  average  bit  rate  that  is  very  close  to  the  first 
order  entropy  of  the  residuals.  Therefore,  in  the  subsequent  experiments,  we  compared  performance  of  different  coders 
based  on  the  first  order  entropies  of  the  residual  images  only.  It  may  be  mentioned  that  a  similar  approach  has  been 
followed  by  other  researchers  [13], 
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4.3  Experimental  Results  Using  Simple  SADPCM  Coders 

For  a  comparative  performance  evaluation  of  the  simple  adaptive  lossless  coders  introduced  above  with  the 
existing  schemes,  we  chose  HINT  [13]  and  fixed  DPCM  [13]  because  they  are  known  to  perform  better  than  other 
simple  lossless  coders  [13], [14].  For  the  fixed  DPCM  method,  ai  and  a2  in  (1)  are  chosen  to  be  0.95.  For  the  HINT 
scheme,  we  implemented  the  4x4  version,  which  is  briefly  outlined  below. 

4.3. 1  Hierarchical  interpolation  (HINT)  method 

The  hierarchical  interpolation  (HINT)  [13]  is  a  multi-resolution  pyramid  coding  scheme  that  begins  with  a  low- 
resolution  version  of  the  original  image,  P0,  and  successively  generates  the  higher  resolutions  Pk,l<k<n,  using  noncausal 
interpolations.  The  lowermost  resolution,  P0,  is  entropy  coded  and  transmitted  first.  Thereafter,  in  a  hierarchical  fashion, 
the  interpolation  scheme  is  used  to  generate  estimates  of  the  unknown  pixel  values  of  Pk  by  calculating  the  average  of  its 
four  nearest  neighbors  obtained  from  P^.  The  estimates  are  rounded  to  their  nearest  integers  and  then  subtracted  from 
the  true  pixel  values.  The  difference  signals  pertaining  to  each  of  the  higher  resolutions,  Dk,  l<k<n,  are  also  entropy 
coded  and  transmitted.  Details  of  the  operations  can  be  found  in  [13]. 

As  far  as  encoding  of  the  difference  signals  is  concerned,  two  kinds  of  HINT  schemes  are  possible;  namely,  i)  a 
scheme  that  uses  only  one  codebook  for  coding  all  the  residual  errors,  which  is  simply  referred  to  as  HINT,  and  ii)  a 
scheme  that  uses  five  different  codebooks  for  coding  the  residual  errors  pertaining  to  different  resolutions,  which  is 
called  HINTS.  Although  HINTS  is  expected  to  perform  better  than  HINT,  the  former  requires  more  computation  and 
additional  side  information. 

4.3.2  Comparison  of  computational  complexity  of  SADPCM,  fixed  DPCM,  and  HINT/HINTS 

Notice  that  the  computational  complexity  of  SADPCM  is  only  marginally  higher  than  that  of  either  DPCM  or 
HINT.  Consider,  for  instance,  the  FADPCM  scheme  described  in  Section  2  above.  Using  the  autocorrelation  approach 
for  estimation  of  a2,  the  extra  computational  load  for  FADPCM  compared  to  fixed  DPCM,  or  HINT,  consists  of 
approximately  three  multiplications  and  three  additions  per  pixel.  The  comparison  between  SADPCM  and  HINTS 
becomes  more  difficult  because  of  the  fact  that  HINTS  requires  construction  and  transmission  of  five  codebooks  instead 
of  only  one.  Thus,  the  extra  computational  load  of  HINTS,  as  compared  to  HINT,  depends  upon  the  choice  of  the 
entropy  coding  algorithm. 

4.3.3  Comparative  experimental  results  of  FADPCM/BADPCM,  HINT,  and  fixed  DPCM 

First,  the  lossless  coding  resuits  are  compared.  Eight  images,  digitized  to  256  gray  levels,  are  chosen  for  this 
study.  Two  of  these  are  digitized  radiographs,  called  R1  and  R2,  respectively,  and  six  others  are  standard  test  images 
available  at  a  number  of  internet  sites.  Most  of  the  standard  pictures  in  our  experiments  are  taken  from  the  test-image 
database  at  the  RPI  site.  These  are  referred  to  as  Lena,  Pepper,  Jet,  Flower,  Cameraman,  and  Bridge,  respectively.  All 
the  pictures  selected  for  this  experiment,  with  the  exception  of  Flower,  Cameraman  and  Bridge,  are  (512x512)  in  size. 
The  picture.  Flower,  is  of  size  (480x512),  whereas  both  Cameraman  and  Bridge  are  of  size  (256x256).  In  this  study, 
performances  of  both  FADPCM  and  BADPCM  are  compared  with  those  of  fixed  DPCM,  HINT,  and  HINTS.  The 
block  size  chosen  for  FADPCM  is  32x32,  and  the  estimation  technique  used  include:  i)  block-by-block  autocorrelation 
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based  approach  for  FADPCM,  and  ii)  the  sliding  window  autocorrelation  approach  for  BADPCM.  It  may  be  mentioned 
that  virtually  identical  results  were  also  obtained  using  the  RLS  estimation  technique. 

As  mentioned  before,  the  performances  of  different  methods  are  compared  on  the  basis  of  the  average  bit  rates, 
as  measured  by  the  first  order  entropies  of  the  residuals  plus  the  side  information  bit  rates,  if  any.  Table  1  summarizes 
these  lossless  coding  results.  The  results  clearly  indicate  that  in  terms  of  lossless  compression  efficiency,  SADPCM 
generally  performs  better  than  the  other  methods  considered. 

Next,  we  compare  the  performance  of  near-lossless  coders  based  on  FADPCM,  BADPCM,  fixed  DPCM,  and 
HINT.  The  results  obtained  for  MARE  values  of  ±1,  and  ±2  are  shown  in  Table  2.  Once  again,  the  results  clearly 
indicate  that  the  proposed  techniques  generally  perform  better  than  the  fixed  DPCM  technique. 

4.4  Improved  SADPCM  Coding  Schemes 

The  purpose  of  this  section  is  to  demonstrate  how  the  coding  performance  of  the  above  SADPCM  coders  can 
be  improved  at  a  little  additional  computational  cost.  The  basic  idea  and  the  motivation  behind  it  are  summarized  below. 

Notice  that  most  image  data  are  nonstationary  in-the-mean,  which  is  usually  caused  by  the  combined  effect  of 
both  reflectance  and  lighting  variations  over  different  parts  of  an  object  or  scene.  Since  predictive  coders  work  best  on 
stationary  data,  it  makes  sense  to  attempt  to  improve  the  performance  of  the  proposed  SADPCM  coders  by  removing  the 
nonstationary  attributes  of  the  image  data. 

The  simplest  method  of  eliminating  nonstationarity-in-the-mean  consists  of  subtraction  of  an  estimated  local 
mean  from  each  pixel  value.  Since  the  SADPCM  coders  utilize  two  coefficients  associated  with  the  nearest  causal 
neighbors  of  each  pixel,  f(i,j),  it  makes  sense  to  obtain  the  staionary-in-the-mean  image  as  follows: 

fm(ij)  =  R[0.5*(  f(ij-l)  +  f(i-l,j) )],  (16a) 

fs(ij)  =  f(ij)  -  fm(ij),  (16b) 

where  fm(ij)  denotes  the  local  mean  at  location  (i,j),  R[.]  denotes  the  operation  of  rounding  to  the  nearest  integer  value, 
and  {fs(ij)}  denotes  the  stationary-in-the-mean  image  data. 

Next,  the  SADPCM  scheme  is  utilized  to  code  { fs(i,j) }  using  either  forward  or  backward  approach,  as 
discussed  earlier.  However,  in  this  case,  both  ai  and  a2  need  to  be  estimated  because  a  fixed  value  for  ai  cannot  be 
assumed  any  more.  The  modified  methods  are  henceforth  called  improved  FADPCM  (I-FADPCM)  and  improved 
BADPCM  (I-BADPCM)  schemes,  respectively.  Next,  some  lossless  compression  results  using  I-FADPCM  only  are 
given  in  the  following  subsection. 

4.4. 1  Experimental  results  using  l-FADPCM 

Table  3  shows  the  lossless  compression  results  on  the  same  set  of  eight  images  using  I-FADPCM.  For  the  sake 
of  comparison,  the  results  using  basic  FADPCM  are  also  quoted  there.  As  can  be  readily  seen,  the  improved  version  of 
FADPCM  performs  better,  albeit  with  some  extra  computational  complexity.  Similar  improvements  are  also  attainable 
using  I-BADPCM  rather  than  the  basic  BADPCM. 

Next,  HBADPCM  coders  are  discussed. 
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4.5  Hierarchical  Block-Adaptive  DPCM  Coders 

There  are  two  main  reasons  that  motivated  us  to  pursue  the  development  of  hierarchical  block-adaptive  DPCM 
(HBADPCM)  coders.  First,  we  wanted  to  adapt  SADPCM  coders  for  hierarchical  transmission  in  order  to  achieve  better 
compression  than  that  afforded  by  the  existing  simple  and  elegant  hierarchical  coders,  such  as,  HINT.  Second,  we 
wanted  to  develop  a  scheme  that  allows  the  user  to  access  different  areas  of  an  image  gradually  according  to  his/her 
preference.  As  far  as  the  first  goal  is  concerned,  since  predictive  coders  work  well  only  on  a  full  image  frame  or 
subsections  of  it,  and  performs  rather  poorly  on  subsampled  images,  we  propose  to  develop  a  block-by-block  encoding 
and  transmission  method.  This  also  seems  to  be  good  for  meeting  our  second  goal  because  an  user  can  selectively  access 
specific  blocks  in  any  desired  sequence. 

In  order  to  achieve  blockwise  hierarchical  transmission,  first  we  must  subdivide  the  image  into  a  number  of 
smaller,  say  MxM,  blocks.  The  next  thing  needed  is  adoption  of  a  suitable  hierarchical  framework  for  coding  and 
transmitting  different  blocks.  Although  a  variety  of  different  hierarchical  frameworks  could  have  been  chosen  for  this 
purpose,  we  picked  the  framework  of  HINT  in  this  study  mainly  because  of  its  excellent  performance  as  a  hierarchical 
lossless  coder.  Also,  as  shown  in  the  following  subsection,  this  framework  allows  a  nice  way  of  implementing  a 
backward  HBADPCM  coder. 

Following  [13]  then,  we  arrange  the  blocks  to  be  coded  in  the  hierarchical  order  illustrated  in  Fig.  2.  In  this 
figure,  the  blocks  are  marked  according  to  their  allotted  level  of  hierarchy.  Thus,  the  blocks  marked  "4"  denote  the  ones 
to  be  coded  and  transmitted  first,  followed  by  the  blocks  marked  "3",  and  so  on. 

Next,  the  coding  and  transmission  of  each  individual  block  is  carried  out  using  either  FADPCM  (which 
requires  blockwise  estimation  of  a2  only),  or  its  improved  version,  I-FADPCM  (which  requires  blockwise  estimation  of 
both  ai  and  a2).  Then  the  block-by-block  residuals  pertaining  to  each  hierarchical  level  are  encoded  using  an  entropy 
coder  and  the  coded  residuals  are  transmitted.  It  is  pointed  out  that  there  is  no  need  to  construct  separate  codebooks  for 
different  hierarchical  levels,  because  all  the  residuals  generated  by  FADPCM  or  I-FADPCM  exhibit  narrow  peaks  and 
can  be  efficiently  coded  using  a  single  codebook.  The  overall  coding  schemes  are  henceforth  referred  to  as  forward 
HBADPCM  (F-HBADPCM)  and  improved  F-HBADPCM,  respectively. 

Until  now,  our  only  use  of  the  HINT  framework  has  been  restricted  to  the  task  of  laying  out  a  hierarchy  of 
blocks.  As  expected,  fruitful  exploitation  of  the  above  framework  allows  further  improvement  of  the  HBADPCM  coder 
discussed  above.  One  such  improvement,  namely,  a  backward  HBADPCM  coder,  is  presented  below. 

4.5.1  A  backward  HBADPCM  scheme 

The  main  objective  of  a  backward  HBADPCM  scheme  would  be  to  modify  the  parameter  estimation  strategy 
such  that  transmission  of  the  block-by-block  model  coefficients  becomes  unnecessary.  In  this  study,  two  useful  strategies 
are  used  to  realize  the  above  goal;  namely,  i)  a  backward  estimation  scheme,  where  the  model  coefficients  pertaining  to 
a  block  are  estimated  from  the  past  reconstructed  pixel  values  within  that  block,  and  ii)  a  hierarchical  approximation 
scheme,  where  the  block-by-block  model  coefficients  are  first  laid  out  in  the  same  hierarchical  pattern  as  the  blocks 
themselves,  and  an  approximate  set  of  blockwise  model  coefficients  is  formed  through  hierarchical  interpolation  from 
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the  past  reconstructed  blocks'  coefficients.  This  set  of  approximate  coefficients  is  used  by  both  transmitter  and  receiver 
avoiding  the  necessity  of  transmittal  of  any  side  information. 

A  comparative  experimental  study  of  the  two  backward  HBADPCM  schemes  indicates  that  the  hierarchical 
approximation  scheme  performs  marginally  better  than  the  backward  approximation  one.  Because  of  this  and  the  fact 
that  the  latter  attempts  a  true  exploitation  of  the  HINT  framework,  we  choose  the  HINT  approximation  approach  here. 
However,  it  may  be  pointed  out  that  like  in  the  forward  case,  there  exist  two  versions  of  the  scheme;  namely,  one  based 
on  blockwise  estimates  of  a2  only,  and  the  other  based  on  blockwise  estimates  of  both  ai  and  a2.  The  overall  coding 
schemes  are  henceforth  referred  to  as  the  backward  HBADPCM  (B-HBADPCM),  and  improved  B-HBADPCM, 
respectively.  Finally,  it  is  pointed  out  that  like  in  the  case  of  SADPCM,  the  residuals  can  be  entropy  coded  using  either 
Huffman,  or  arithmetic  coders,  both  of  which  achieve  an  average  bit  rate,  which  is  very  close  to  the  first  order  entropy 
of  the  residulas.  Hence  the  performances  of  different  coders  were  evaluated  on  the  basis  of  the  first  order  entropies  of 
the  residual  images  only. 

Next,  one  important  thing  that  needs  to  be  addressed  relates  to  the  question  of  how  to  interpolate  for  the 
missing  blocks  so  that  higher  level  approximations  can  be  generated,  if  needed,  from  the  reconstructed  image  blocks 
pertaining  to  a  lower  hierarchical  level.  This  issue  is  addressed  in  the  next  subsection. 

4.5.3  A  scheme  for  interpolation  for  the  missing  blocks 

Notice  that  the  problem  of  interpolation  for  the  missing  blocks,  in  the  above  context,  bears  a  striking  similarity 
to  the  problem  of  error  concealment  in  case  of  loss  of  cells  during  transmission  [28], [29].  Thus,  techniques  used  for 
error  concealment  -  particularly,  the  spatial  domain  ones,  are  well  suited  to  our  task. 

In  view  of  above,  we  propose  to  use  a  simpler  version  of  the  projective  interpolation  technique  [29],  for 
demonstration  of  interpolation  for  the  missing  blocks.  In  the  projective  interpolation  scheme,  Jung  et  al  suggests  using 
bilinear  interpolation  adapted  to  the  edge  pattern  of  a  missing  block.  Essentially,  their  method  consists  of  four  steps; 
namely,  i)  examination  of  the  boundary  pixel  values  of  a  missing  block  for  possible  edge  patterns  within  the  missing 
one,  ii)  classification  of  a  missing  block  into  one  of  six  categories  depending  on  the  possible  edge  patterns,  iii) 
determination  of  the  interpolation  direction,  and  iv)  interpolating  the  missing  block  using  a  weighted  bilinear 
interpolation. 

In  this  study,  we  use  a  simpler  version  of  the  above  technique,  which  can  be  summarized  as  follows: 

•  For  each  pixel  within  a  missing  block,  examine  four  pairs  of  boundary  pixel  values  -  located  along  the 
horizontal,  vertical,  45°,  and  135°  directions; 

•  Identify  the  pair  possessing  minimum  absolute  difference  among  the  four,  and  perform  a  linear 
interpolation  in  the  corresponding  direction. 

Finally,  some  experimental  results  are  presented  in  the  following  section. 

4.6  Experimental  Results  Using  HBADPCM 

In  this  section  we  present  some  experimental  results  comparing  the  performance  of  the  proposed  coders  with 
that  of  HINT  [13].  Before  presentation  of  the  quantitative  results,  however,  first  we  compare  the  computational 
complexities  of  the  methods  considered. 
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4.6. 1  Comparison  of  computational  complexity  ofHBADPCM  and  HINT 

Notice  that  HBADPCM  requires  estimation  of  model  coefficients.  Thus,  its  computational  complexity  is 
slighdy  higher  than  that  of  HINT.  Consider,  for  instance,  the  basic  forward  HBADPCM  coder,  F-HBADPCM.  Using 
the  autocorrelation  approach  for  estimation  of  a7,  the  computational  load  due  to  parameter  estimation  consists  of 
approximately  three  multiplications  and  three  additions  per  pixel.  This,  however,  gets  approximately  doubled  for  the 
improved  F-HBADPCM,  which  requires  blockwise  estimation  of  both  ai  and  a2. 

On  the  other  hand,  it  may  be  pointed  out  again  that  the  proposed  coders  do  not  require  multiple  codebooks  for 
transmission  of  the  residuals  pertaining  to  different  hierarchical  levels.  This  constitutes  one  of  the  key  advantages  of  the 
proposed  coders  over  HINT,  because  the  latter  does  not  perform  well  without  separate  codebooks.  To  stress  this  point 
further,  experimental  results  are  given  for  both  single  codebook  and  five  codebook  versions  of  HINT.  The  five 
codebook  version,  which  uses  separate  codebooks  for  different  hierarchical  levels,  is  henceforth  referred  to  as  HINTS. 

4.6.2  Comparative  experimental  results 

In  this  comparative  study,  the  test-images  used  are  same  as  the  ones  utilized  earlier  for  the  SADPCM  coders. 
The  block  size  chosen  for  all  HBADPCM  coders  is  32x32.  For  the  basic  forward/backward  HBADPCM  coders,  the 
fixed  value  of  ai  is  chosen  to  be  0.99,  and  the  autocorrelation  approach  is  used  for  estimation  of  a2.  For  the  improved 
forward/backward  HBADPCM  coders,  both  a]  and  a2  were  estimated  using  the  autocorrelation  approach.  It  may  be 
mentioned  that  virtually  identical  results  were  also  obtained  using  the  RLS  estimation  technique. 

As  mentioned  before,  the  performances  of  different  methods  are  compared  on  the  basis  of  the  attainable 
average  bit  rates,  as  measured  by  the  first  order  entropies  of  the  residuals  plus  the  side  information  bit  rates,  if  any.  Table 
4  summarizes  these  lossless  coding  results.  The  results  clearly  indicate  that  in  terms  of  lossless  compression  efficiency, 
HBADPCM  coders  perform  better  than  HINT. 

Finally,  Figs.  (3a)  and  (3b)  demonstrate  the  effectiveness  of  the  simple  interpolation  scheme  for  the  missing 
blocks,  presented  in  Section  4.5.3  above.  Fig.  (3a)  depicts  the  missing  blocks  of  Pepper  in  Level  1,  assuming  a  block 
size  of  8x8,  whereas  Fig.  (3b)  shows  the  interpolated  picture.  As  is  apparent  from  these  figures,  even  a  simple 
interpolation  scheme  produces  reasonably  good  quality  of  interpolated  pictures.  Of  course,  block  sizes  have  to  chosen 
small  for  achieving  good  results. 

5.  Conclusion 

Three  objectives  are  achieved  in  this  study.  First,  "C"  codes  for  some  existing  lossless  and  near-lossless  image 
coding  schemes  are  developed.  Second,  two  new  classes  of  lossless/near-lossless  and  hierarchical  coders  are  developed. 
Finally,  a  comparative  performance  evaluation  of  all  the  methods  is  undertaken.  The  results  of  the  comparative 
experimental  studies  indicate  that  the  new  methods  perform  better  than  hierarchical  interpolation  (HINT)  and  fixed 
DPCM  methods.  Particularly,  the  improved  SADPCM  coders  and  their  hierarchical  versions  perform  significantly  better 
than  the  other  methods  considered.  Since  the  computational  complexity  of  SADPCM  is  only  slightly  higher  than  fixed 
DPCM,  and  the  same  holds  true  for  HBADPCM  as  compared  to  HINT,  it  is  believed  that  SADPCM  and  HBADPCM 
coders  should  find  widespread  use  in  applications  involving  fast  lossless/near-lossless,  and  hierarchical  compression. 


9-14 


Acknowledgement 

The  author  gratefully  acknowledges  the  help  and  cooperation  affoided  by  all  the  following  personnel  affiliated 
with  the  Rome  laboratory: 

Mr.  Shawn  Bisgrove,  Mr.  Ed  Bohling,  Ms.  Audrey  Copperwheat,  Mr.  Chuck  Ferrara,  Mr.  Jerry  Nethercott, 

Mr.  Patrick  O’Connor,  Mr.  David  Quinn-Jacobs,  Mr.  Fred  Rahrig,  Mr.  John  Reilly,  Mr.  Mark  Rosiek,  Mr. 
Richard  Simard,  and  Mr.  Paul  Smith.  Without  their  help,  this  study  could  never  have  been  completed. 

References 

[1]  A.  K.  Jain,  "Image  Data  Compression:  A  Review,"  Proceedings  of  the  IEEE,  Vol.  69,  pp.  349-389,  March 
1981. 

[2]  M.  Rabbani,  editor.  Selected  Papers  on  Image  Coding  and  Compression,  SPEE  Optical  Engineering  Press, 
Bellingham,  Washington,  1992. 

[3]  W.  B.  Pennebaker,  JPEG  Still  Image  Data  Compression  Standard,  Van  Nostrand  Reinhold,  New  York,  1993. 

[4]  R.  B.  Arps  and  T.  K.  Truong,  "Comparison  of  International  Standards  for  Lossless  Still  Image  Compression," 
Proc.  IEEE,  Vol.  82,  No.  6,  pp.  889-899,  JUne  1994. 

[5]  A.  K.  Jain,  Fundamentals  of  Digital  Image  Processing,  Prentice  Hall,  Englewood  Cliffs,  NJ,  1989. 

[6]  A.  Gersho  and  R.  M.  Gray, "  Image  Coding  Using  Vector  Quantization,"  Proc.  IEEE  International  Conference 
on  Acoustics  Speech  and  Signal  Processing,  pp.  428-431,  April  1982. 

[7]  H.  M.  Hang  and  J.  W.  Woods,  "Predictive  Vector  Quantization  of  Images,"  IEEE  Transactions  on 
Communications,  Vol.  COM-33,  pp.  1208-1219,  November  1985. 

[8]  P.  J.  Burt  and  E.  H.  Adelson,  "The  Laplacian  Pyramid  as  a  Compact  Image  Code,"  IEEE  Transactions  on 
Communications,  Vol.  COM-31,  No.  4,  pp.  532-540,  April  1983. 

[9]  M.  Todd  and  R.  Wilson,  "An  Anisotropic  Multi-Resolution  Image  Data  Compression  Algorithm,"  Proc.  IEEE 
International  Conference  on  Acoustics  Speech  and  Signal  Processing,  pp.  1969-1972,  April  1989. 

[10]  S.  R.  Burgett  and  M.  Das,  "Predictive  Image  Coding  Using  Multiresolution  Multiplicative  Autoregressive 
Models,"  DEE  Proceedings  -  Part  I,  Vol.  140,  No.  2, 1993. 

[11]  J.  W.  Woods  and  S.  D.  O'Neil,  "Subband  Coding  of  Images,"  IEEE  Transactions  on  Acoustics,  Speech  and 
Signal  Processing,"  Vol.  ASSP-34,  pp.  1278-1288,  October  1986. 

[12]  S.  R.  Burgett  and  M.  Das,  "Predictive  Image  Coding  Using  Two-Dimensional  Multiplicative  Autoregressive 
Models,"  Signal  Processing,  Vol.  31,  No.  2,  1993. 

[13]  P.  Roos  et  al,  "Reversible  Intraffame  Compression  of  Medical  Images,"  IEEE  Transactions  on  Medical 
Imaging,  Vol.  7,  No.  4,  pp.  328-336,  1988. 

[14]  M.  Das  and  S.  Burgett,  "Lossless  Compression  of  Medical  Images  Using  Two-Dimensional  Multiplicative 
Autoregressive  Models,"  IEEE  Trans,  on  Medical  Imaging,  Vol.  12,  No.  4,  pp.  7221-726,  1993. 

[15]  M.  Das  and  C.  C.  Li,  "Simple  Space-Varying  Least  Squares  Model  for  Lossless  Medical  Image  Compression," 
Electronics  Letters,  Vol.  30,  No.  1 1,  pp.  849-850, 1994. 


9-15 


[16]  N.  Tavakoli,  "Lossless  Compression  of  Medical  Images,"  Proceedings  of  Fourth  Annual  IEEE  Symposium  on 
Computer-Based  Medical  systems,  pp.  201-207, 1991. 

[17]  V.  K.  Heer  and  H-E  Reinfelder,  "A  Comparison  of  Reversible  Methods  for  Data  Compression,"  SPIE  Vol. 
1233,  Medical  Imaging  IV:  Image  Processing,  pp.  354-365,  1990. 

[18]  H.  Blume  and  A.  Fand,  "Reversible  and  Irreversible  Image  Data  Compression  Using  S-Transform  and  Lempel- 
Ziv  Coding,"  SPIE  Vol.  1091,  Medical  Imaging  HI:  Image  Capture  and  Display,  pp.  2-17, 1989. 

[19]  L.  Wang  and  M.  Goldberg,  "Comparative  Performance  of  Pyramid  Data  Structures  for  Progressive 
Transmission  of  Medical  Imagery,"  SPIE  Vol.  1232,  Medical  Imaging  IV:  Image  Capture  and  Display,  pp. 
403-413, 1990. 

[20]  L.  Wang  and  M.  Goldberg,  "Reduced-Difference  Pyramid:  a  Data  Structure  for  Progressive  Transmission," 
Optical  Engineering,  Vol.  28,  No.  7,  pp.  708-716, 1989. 

[21]  D.  A.  Huffman,  "A  Method  for  Construction  of  Minimum  Redundancy  Codes,"  Proc.  IRE,  Vol.  40,  pp.  1098- 
1101,1952. 

[22]  R.  G.  Galleger,  "Variations  on  a  Theme  by  Huffman,"  TF.F.F.  Transactions  on  Information  Theory,  Vol.  24, 
No.6,  pp.  668-674,  1987. 

[23]  A.  Lempel  and  J.  Ziv,  "Compression  of  Two-Dimensional  Images,"  Combinatorial  Algorithms  on  Words,  pp. 
1410-154,  Springer-Verlag,  1985. 

[24]  I.  H.  Witten,  R.  M.  Neal,  and  J.  G.  Cleary,  "Arithmetic  Coding  for  Data  Compression,"  Communications  of  the 
ACM,  Vol.  30,  No.  6,  pp.  520-540,  June  1987. 

[25]  M.  Das  and  D.  L.  Neuhoff,  "Near-Lossless  Compression  of  Digitized  Images,"  Communications  and  Signal 
Processing  Lab  Report  #283,  University  of  Michigan  at  Ann  Arbor,  June  1993. 

[26]  M.  Das,  D.  L.  Neuhoff,  and  C.  L.  Lin,  "Near-Lossless  Compression  of  Medical  Images,"  Proc.  IEEE 
International  Conf.  on  Acoustics  Speech  and  Signal  Processing,  held  in  Detroit,  Michigan,  1995. 

[27]  L.  Ljung  and  T.  Soderstrom,  Theory  and  Practice  of  Recursive  Identification ,  MIT  Press,  Cambridge,  1983. 

[28]  A.  Narula  and  J.  S.  Lim,  "Error  Concealment  Techniques  for  an  All-Digital  High-Definition  Television 
System,"  SPIE,  Vol.  2094,  pp.  304-315, 1993. 

[29]  K-H  Jung,  J-H  Chung,  and  C.  W.  Lee,  "Error  Concealment  Technique  Using  Projection  Data  for  Block-Based 
Image  Coding,"  SPIE,  Vol.  2308,  pp.  1466-1476, 1994. 


9-16 


Fig.  1.  Causal  window  for  estimation  of  rn(l,0)  and  rfl(0,0) 
for  each  pixel.  □  denotes  current  pixel,  X  denotes  window  pixels. 
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Figure  2.  Hierarchical  ordering  of  blocks.  Blocks  "4"  are 
coded  first,  followed  by  blocks  "3",  and  so  on. 


Images 

Average  bit  rates  (in  bits/pixel)  using  different  lossless  coders 
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Table  1 .  Average  bit  rates  (in  bits/pixel)  using  different  lossless  coders 
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Images 

Average  bit  rates  (in  bits/pixel)  for  maximum 
error  of  ±  1 
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Table  2.  Average  bit  rates  (in  bits/pixel)  using  different  near-lossless  coders 
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Images 


Average  bit  rates  (in  bits/pixel)  using  different  coders 
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Table  4.  Average  bit  rates  (in  bits/pixel)  using  different  coding  schemes 


Figure  3a.  Missing  8x8  blocks  of  Pepper  in  Level  1 
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Figure  3b.  Interpolated  Pepper  from  available  blocks  of  Level  1 
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Abstract 

Fiber-optics  technology  has  become  a  cornerstone  for  advanced  information  communications 
in  both  the  military  and  the  civilian  sectors.  The  availability  of  mode-locked  fiber-optic  sources 
would  provide  compatibility  with  existing  fiber-optic  structures  and  an  inexpensive  alternative 
to  sub-picosecond  semiconductor  sources  in  this  wavelength  regime.  In  this  program  we  have 
continued  to  develop  a  unique  capability  in  the  simulation  of  mode-locked  fiber-optic  sources.  I 
have  endeavored  to  examined  several  avenues  to  creating  reliable  mode-locked  laser  sources  and 
improving  their  behavior. 

I  have  also  proposed  a  novel  mode-locking  scheme  using  fiber  Bragg  gratings  that  is  being  ex¬ 
amined  for  implementation  in  the  laboratory.  The  research  is  a  collaboration  with  James  Theimer 
at  Rome  Labs  and  is  designed  to  support  ongoing  experiments  of  Reinhard  Erdmann  and  my 
student.  Walter  Kaechele,  who  working  is  at  Rome  Labs. 

Two  papers  were  prepared  and  submitted  for  publication  during  the  summer  program.  Con¬ 
ference  papers  have  been  accepted  at  the  Optical  Society  Annual  Meeting  and  further  conferences 
have  been  identified  to  disseminate  research  results.  Other  aspects  of  the  summer  research  are 
being  completed  after  the  program's  end. 
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1  Introduction 


Information  systems  have  become  one  of  the  fastest  growing  sectors  of  the  economy  and  its  need  for 
improving  military  capabilities  cannot  be  understated.  In  a  little  over  one  century  the  capability 
of  undersea  cables  to  communicated  has  risen  by  a  factor  of  more  than  109  (from  a  few  words  per 
minute  in  trans-oceanic  communications  of  the  1850’s[l]  to  GHz  data  rates  today).  The  demand 
for  more  bandwidth  in  communications  continues  to  drive  the  technology  toward  more  significant 
achievements.  The  continued  improvements  will  also  have  significant  impact  in  the  arenas  of  local 
area  networks  and  wide  area  networks,  which  will  also  place  further  demands  toward  improvements 
in  optical  communications. 

The  new  era  of  economic  revolutions  promises  to  be  information  intensive,  rather  than  labor 
intensive,  as  was  the  first  industrial  revolution.  In  the  first  industrial  revolution  work  done  by 
individuals  was  rendered  inefficient  and  animals  were  set  out  to  pasture  as  machines  took  over 
their  function.  Large  machines  and  the  expenditure  of  huge  amounts  of  power  are  now  relegated 
to  a  lesser  role,  while  intellect  or  human  creativity  and  information  take  on  a  primaxy  role.  In 
lightwave  communications  the  full  potential  of  the  information  age  can  be  achieved;  the  workhorse 
of  the  information  age,  the  laser,  will  bring  to  its  innovators  increased  productivity,  reliability  and 
commercial  profit. 

There  are  several  enabling  technologies  needed  for  this  development  among  them  are  rapid 
detectors,  short-pulse  sources  and  ultrafast  optical  switches.  Detectors  are  already  available,  but 
require  improvements  in  speed  and  sensitivity  to  meet  the  challenges  of  terabit  data  rates.  Optical 
switches,  on  the  other  hand,  have  not  yet  matured  as  a  usable  technology  and  have  remained  an 
adolescent  technology;  nevertheless,  they  form  a  critical  element  for  rapidly  getting  information 
ON  and  OFF  the  so-called  Information  Superhighway. 

There  are  several  sources  that  can  be  employed  in  these  networks  and  provide  the  workhorse 
necessary  to  drive  the  information  age.  Already  in  use  are  semiconductor  lasers,  which  have 
become  ubiquitous  in  our  daily  life  as  elements  in  laser  scanners  and  optical  disk  readers.  However, 
ultrafast  semiconductor  lasers  with  the  requisite  bandwidth  are  not  available  and  modulation  has 
been  relegated  to  expensive,  ultrafast  modulators. 

A  major  breakthrough  toward  the  eventual  development  of  high  data-rate  optical  communica¬ 
tions  occured  in  the  1980’s  with  the  discovery  that  erbium-doped  fibers  exhibit  gain  in  1.55  \i m 
wavelength  regime;  this  regime  is  of  particular  interest  for  long-distance  communications,  since 
fibers  exhibit  their  minimum  loss  in  this  regime.  Erbium-doped  fiber  amplifiers  promise  repeater¬ 
less  transmission  of  optical  data.  This  eliminated  the  need  for  expensive  electronic  repeaters  in 
long-distance  communications  systems.  Erbium-doped  fibers  are  also  an  essential  building  block 
of  mode-locked  fiber  lasers. 

Laser  diodes  (MOPA  -  Master  Oscillator- Power  Amplifier)  that  are  compact  and  efficient 
are  being  used  to  pump  erbium-doped  amplifier  systems.  This  same  technology  is  applied  to 
pump  mode-locked  fiber  lasers.  The  wide  bandwidth  available  for  amplifications  (about  30  nm) 
is  sufficient  to  create  pulses  approximately  as  short  as  70  fs  duration;  in  a  network  this  would 
correspond  to  THz  data  rates.  Erbium-doped  fiber  lasers  are  a  potential  ultrashort  pulse  source 
for  optical  communications  technologies [2,  3].  They  provide  high  repetition  rates  in  the  wavelength 
regime  corresponding  to  the  minimum  loss  of  silica  fibers.  Such  lasers  have  distinct  advantages,  for 
instance,  they  can  be  pumped  using  diode  lasers  and  they  are  compatible  with  fiber  technologies. 
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However,  they  suffer  from  stability  and  reproducibility  problems  because  of  their  complex 
dependence  on  the  polarization  of  the  light  in  the  laser  cavity  [4,  5,  6],  Recently,  this  problem  has 
been  addressed  and  new  cavity  designs  have  been  proposed  to  operate  with  ultrashort  pulses  while 
being  insensitive  to  environmental  changes  [7,  8,  9,  10,  11,  12,  13,  14].  The  new  cavity  designs 
incorporate  birefringent  fiber  amplifiers  as  elements  to  control  the  polarization  state  of  the  light 
in  the  fiber.  Polarization  mode-locking  combines  the  uses  of  birefringent  fibers  with  polarization 
elements  to  create  fast  saturable  absorber-like  action. 

Generally,  when  a  soliton  pulse,  which  has  a  well  defined  polarization  state,  propagates  through 
a  birefringent  fiber,  then  cross-  and  self-phase  modulation  effects  act  to  rotate  the  polarization 
state  of  the  pulse  in  addition  to  the  linear  birefringence  of  the  fiber.  When  that  fiber  is  an 
amplifier  though,  the  perturbation  of  the  pulse  phase  and  amplitude  can  be  large  enough  that 
the  polarization  varies  across  the  pulse.  In  this  situation,  the  mode-locking  element  no  longer 
optimally  functions  as  designed.  The  result  will  be  a  pulse  that  is  distorted  from  its  original 
shape. 

The  following  Section  is  a  discussion  of  mode-locked  fiber  lasers  using  three  examples  of  passive 
mode-locking  elements.  The  new  one  we  proposed  this  past  summer  is  the  Bragg  grating  fiber. 
Section  3  is  a  presentation  of  our  results  on  figure-eight  lasers;  we  have  developed  a  simulation 
technique  that  is  applicable  to  examining  the  steady-state  operation  of  these  lasers;  we  also  learned 
from  these  studies  that  a  technique  we  dubbed  dispersion  balancing  can  be  used  to  engineer  a  more 
robust  figure-eight  laser.  The  final  Section  is  devoted  to  a  brief  summary  and  some  comments 
about  future  work. 

2  Mode-locked  Fiber  Lasers 

Erbium-doped  fiber  lasers  (EDFL)  have  many  cavity  designs  and  operation  regimes.  Each  is 
being  explored  as  a  source  of  high  repetition  rate,  energetic,  ultrashort  pulses,  that  meets  the 
needs  of  future  communication  networks.  Ultrashort  pulses  are  generated  using  mode- locking 
techniques.  Active  mode-locking,  using  for  example  a  Mach-Zehnder  modulator  in  the  cavity, 
has  been  valuable  as  a  source  of  regularly  spaced,  high  repetition-rate  pulses  [15].  Passive  mode¬ 
locking,  using  a  fast  saturable  absorber,  has  produced  pulses  of  sub  picosecond  duration.  The 
pulses  in  each  case  are  soliton-like,  i.e.  hyperbolic  secant  shaped;  solitons  have  proven  to  be  robust 
against  the  presence  of  losses  and  amplification  in  fiber  transmission  systems;  i.e.  they  don’t  alter 
their  shape  in  the  presence  of  small  perturbations.  The  topic  of  soliton  transmission  in  optical 
fibers  has  rapidly  evolved  from  a  pure  research  topic  to  an  emerging  technology  through  a  series  of 
important  technological  breakthroughs  (overcoming  challenging  obstacles)  in  long-distance,  high 
bit-rate  communication  systems.  In  addition,  soliton  interactions  have  been  proposed  for  logic 
and  routing  devices,  which  perform  important  information  processing  tasks  [16]. 

A  short-pulse  laser  requires  saturable  absorber-like  action  to  assist  in  achieving  shorter  pulses. 
The  saturable  absorber  transmits  more  of  the  higher  intensity  regime  of  the  pulse  and  the  lower 
intensity  experiences  higher  losses.  There  are  now  several  fiber-optic  devices  that  can  be  applied  to 
achieve  the  fast  saturable  absorber-like  action  and  there  is  already  an  extensive  literature  developed 
for  EDFA  [2]  and  EDFL  [3].  The  optimum  configuration  of  the  devices  in  each  laser  design 
requires  a  detailed  analysis.  Recently,  the  combination  of  active  and  passive  mode-locking  has 
been  simulated  as  a  source  of  stable  pulse  trains  and  ultrashort  pulse  operation  [17];  Active  mode- 
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locking  is  an  effective  strategy  to  shorten  pulses  because  the  modulator  has  a  sharp  transmission 
function;  however,  there  are  physical  limitations  set  by  the  modulator’s  response  time;  perhaps  a 
few  picoseconds  could  be  attained,  but  with  the  introduction  of  passive  mode-locking  ultrashort 
pulses,  i.e.  less  than  1  picosecond  duration,  can  be  achieved.  By  further  modeling  and  simulations, 
the  pulse  shape,  height  and  energy  can  be  optimized. 

We  are  developing  reliable,  numerical  simulations  of  EDFL  using  three  types  of  passive  mode¬ 
locking  (PML).  As  previously  mentioned,  the  action  of  PML  relies  on  a  fast  saturable  absorber-like 
action.  Three  devices  that  perform  this  task  are:  nonlinear  loop  mirrors,  dual-core  fibers  and  fiber 
Bragg  gratings;  this  list  is  not  exhaustive.  In  the  following  we  briefly  discuss  each  type. 

2.1  Nonlinear  Loop  Mirrors 

The  nonlinear  optical  loop  mirror  (NOLM)  [18]  works  in  a  manner  similar  to  a  Sagnac  interfer¬ 
ometer.  A  length  of  fiber  is  formed  into  a  loop  and  the  two  ends  are  connected  to  a  four  port 
directional  coupler  device  that  splits  the  input  pulse  intensity  by  unequal  amounts  (splitting  ratios 
between  the  two  output  ports  are,  say,  a  and  1  —  a;  the  ratio  of  the  two  is  denoted:  a/(  1  —  a)  ) 
into  two  counter-propagating  directions  around  the  loop;  as  the  pulses  travel,  each  one  picks  up 
a  different  optical  path  difference  that  is  related  to  its  intensity.  If  the  intensity  of  both  pulses  is 
low,  then  the  phases  are  nearly  the  same  and  the  light  recombines  in  the  coupler  and  is  mostly 
reflected  back  into  the  same  port  as  the  input  pulse;  this  light  is  rejected  from  the  laser  cavity 
by  a  isolator.  As  the  intensity  is  raised  the  nonlinear  contribution  to  the  polarization  with  Kerr 
coefficient  x^3\ 

P  =  |x(3'|£|  2E,  (1) 

changes  the  optical  path  length  of  each  pulse  by  a  different  amount;  thus,  the  output  intensity 
increases  in  the  second  port.  A  related  device  is  the  nonlinear  amplifying  loop  mirror  (NALM) 
[19],  which  places  an  amplifier  at  one  end  of  the  loop.  In  this  case  a  50/50  coupler  can  be  used 
and  still  the  pulse  amplified  at  the  beginning  of  the  loop  will  experience  a  greater  optical  path 
difference  than  the  counter-propagating  pulse  that  is  amplified  after  passing  around  the  loop.  The 
transmission  through  the  second  (output)  port  for  an  NALM  is  given  for  a  CW  wave  by 

T2  =  G  (1  —  2a(l  —  a){l  +  cos  Ad>}) ;  (2) 

where  G  is  the  gain  and  the  phase  is  given  by 

=  (G(l  -  a)  -  a)  \Ein\22rn2L/\.  (3) 

The  new  parameters  are:  T,  the  length  of  the  loop;  A,  the  wavelength  in  vacuum;  n2  the  coefficient 
of  the  nonlinear  index,  which  is  related  to  the  Kerr  coefficient. 

The  transmission  of  the  NOLM  ( G  =  1  in  Eq.  (2)  has  a  transmission  maximum  for  the  value  of 
the  phase  shift  A<j>  =  r.  For  a  pulse  whose  intensity  is  less  than  the  maximum  on  the  transmission 
curve  higher  loss  is  experienced  for  lower  intensities  and  it  acts  like  a  fast  saturable  absorber.  For 
ultrashort  pulses  the  length  of  fiber  required  is  only  a  few  meters,  but  self-frequency  shifts,  due  to 
stimulated  Raman  scattering  limit  the  power  levels  of  the  PML  device.  In  truth,  Eq.  (2)  does  not 
accurately  describe  the  action  of  an  NOLM  or  NALM.  Detailed  numerical  analysis  needs  to  be 
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considered.  In  this  device  the  birefringence  of  the  fiber  plays  an  important  role  and  polarization 
controls  are  used  to  correct  for  the  distortion  of  the  polarization  state. 

Two  other  related  PML  devices  also  act  as  as  fast  saturable  absorbers:  the  dual-core  fiber  laser 
recently  proposed  In  Ref.(  [20]),  whose  length  has  been  chosen  as  a  half  beat  or  in  other  words  so 
that  light  input  in  one  core  exits  in  the  other  core,  and  the  birefringent  fiber  with  cross  polarizers 
[21,  22,  23.  24].  The  dual  core  fiber  distributes  the  loss  over  the  fiber  length  and  consists  of  a 
core  with  gain  enclosed  in  a  cavity,  while  the  second  core  is  open  to  reject  any  unwanted  CW  or 
pedestal  on  the  pulse;  this  is  the  essence  of  a  PML  device.  The  birefringent  fiber  with  polarizers 
will  not  be  further  discussed  here. 

2.2  Dual-core  fibers 

In  dual  core  fibers  the  field  can  couple  between  the  two  cores;  as  it  propagates;  in  the  linear 
regime,  like  a  pair  of  coupled  oscillators,  the  energy  periodically  passes  from  one  core  to  the  other. 
For  nonlinear  materials  the  situation  is  altered,  as  the  intensity  is  raised,  the  pulse  couples  less 
and  less,  until  it  remains  on  a  single  core;  this  effect  has  been  studied  for  use  as  a  nonlinear  switch 
beginning  with  the  work  of  Jensen  [25]  on  CW  operation.  In  the  nonlinear  dual-core  fiber  the 
soliton  is  special  again,  when  the  coupling  is  a  small  perturbation,  the  soliton  retains  its  shape 
and  the  pulse  is  not  cut  off  in  the  wings.  Recent  proposals  [26]  include  gain  and  loss  in  the  fibers 
to  improve  the  switching  characteristics,  i.e.  lower  power  and  shorter  fiber  lengths,  or  applications 
as  a  demultiplexer  or  narrow  bandpass  filter  [27]. 

Soliton  pairs  can  even  interact  between  the  cores  and  be  used  for  gating  and  logic  operations 
[28,  29],  much  like  the  interaction  between  two  orthogonally  polarized  solitons  used  for  dragging 
one  soliton  with  the  aid  of  a  second  one[16].  In  dual-core  fibers  Abdullaev  reported  the  existence 
of  an  attractive  and  repulsive  interaction  between  solitons  and  adjacent  fibers;  this  can  be  used  as 
the  basis  for  new  applications  in  pushing  or  pulling  solitons  within  some  time  window  or  spectral 
window.  One  such  application  is  the  logic  gate  operation  proposed  in  Ref.  [29]. 

The  same  properties  of  dual-core  fibers  that  permit  logic  and  switching  operations  also  make 
them  suitable  as  saturable  absorbers  in  a  fiber  laser  geometry  [30,  31].  The  fiber  length  chosen 
is  1/2  a  beat  length,  i.e.  the  length  chosen  in  the  linear  regime,  such  that  the  input  energy  in 
one  core  would  emerge  in  the  other  core,  to  remove  the  weak  continuous  wave  component  (i.e. 
removes  the  pulse  pedestal)  and  the  pulses  have  soliton-like  shapes.  Due  to  the  switching  charac¬ 
teristics  mentioned  above,  the  dual-core  fiber  has  the  characteristics  of  a  saturable  absorber.  In 
their  calculations  Winful  and  Walton  [20]  made  use  of  the  PML  property  of  the  dual-core  fiber. 
They  obtained  short  bandwidth  limited  pulses  using  this  scheme.  Our  investigations  incorporat¬ 
ing  Raman  scattering,  third-order  dispersion  and  saturation  show  that  the  pulse  is  qualitatively 
changed  [31].  The  incorporation  of  this  device  into  other  laser  cavity  designs  will  require  careful 
consideration  of  these  limitations,  as  well. 

The  combination  of  both  positive  and  negative  dispersion  in  single  core  fibers  has  been  applied 
to  increase  the  power  output  without  sacrificing  the  ability  to  shorten  the  pulse  width  or  using 
external  pulse  compression  techniques  [32].  For  single-mode  fibers  operating  at  1.55  // m,  the  dual¬ 
core  fiber  can  be  designed  to  have  a  half-beat  length  aro.und  1  m  and  typical  peak  power  threshold 
for  the  saturation  near  1  KW,  when  the  pulse  width  is  less  than  1  ps.  The  energy  in  the  pulses 
will  be  several  hundred  picojoules. 
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2.3  Fiber  Bragg  Gratings 

Fiber-optic  Bragg  gratings  have  been  fabricated  with  reflectivities  in  excess  of  99.8  %.  They  are 
written  into  the  fiber  by  exposing  a  photosensitive  core  to  ultraviolet  light.  The  depth  of  the  index 
difference  in  the  periodic  structure  can  be  as  much  as  10-2  and  this  is  sufficient  for  a  stop  band 
of  several  nm.  In  practice  the  transmission  function  is  strongly  dependent  on  the  wavelength  and 
on  the  short  wavelength  side,  light  is  lost  by  scattering  to  the  cladding. 

The  simplest  case  to  consider  is  a  multilayered  structure[33,  34].  It  exhibits  interesting  phe¬ 
nomena,  such  as,  high  reflectivity  over  a  frequency  range,  a  so-called  forbidden  band  or  stop 
band.  The  theory  has  a  strong  correspondence  with  the  quantum  theory  of  periodic  lattices,  so 
we  draw  heavily  on  physical  concepts  developed  in  solid  state  physics.  The  band  structure  for  a 
one-dimensional  structure  always  possesses  a  stop  band,  no  matter  how  small  the  index  difference. 
This  is  good  news  for  the  development  of  future  applications  of  these  devices  because  they  won’t 
place  emphasis  on  extreme  material  properties. 

The  nonlinear  dielectric  response  of  the  material  can  be  enhanced  through  the  electromagnetic 
resonances  at  stop-band  edges  in  the  medium.  The  use  of  periodic  dielectric  materials  can  con- 
trollably  enhance  the  local  field  using  interference  effects.  Periodic  structures  that  incorporate  a 
nonlinear  response  of  the  medium  exhibit  interesting  phenomena. 

Chen  and  Mills[35]  considered  a  periodic  structure  with  an  intensity-dependent  polarizability. 
When  the  light  frequency  is  tuned  inside  the  stop  gap  and  has  a  low  intensity,  the  transmissivity 
is  low,  as  expected.  As  the  intensity  is  increased  though,  the  transmissivity  begins  to  increase 
until  the  sample  is  transparent.  There  is  a  bistable  behavior  of  the  transmissivity  very  similar  to 
the  behavior  of  a  Kerr  nonlinearity  in  a  Fabry-Perot  cavity [36]. 

This  phenomenon  can  be  physically  explained  as  a  shift  of  the  dielectric  values  that  moves  the 
edge  of  the  stop  band  toward  the  excitation  frequency.  As  it  moves  out  of  the  stop  band,  the 
transmission  dramatically  changes. 


€ 

Figure  1:  The  spatial  profile  of  the  field  intensity  for  the  input  power  at  which  the  transmission 
becomes  unity.  The  oscillations  are  the  Floquet-Bloch  function  and  the  envelope  is  a  hyperbolic- 
secant  squared. 

Chen  and  Mills  also  discovered  that  the  field-envelope  inside  the  structure  has  a  hyperbolic 
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secant  shape,  which  is  a  well-known  form  of  soliton  solutions,  see  Fig.  1.  Thus,  they  named  the 
field  structure  a  gap  soliton.  As  with  many  such  numerical  discoveries,  the  simplicity  of  the  field 
envelope  suggested  that  it  would  be  possible  to  find  an  appropriate  analytical  theory  to  describe 
the  nonlinear  phenomena  in  periodic  structures. 

Two  types  of  experiments  have  reported  verification  of  the  gap-soliton  predictions.  Sankey  et 
al.[37]  use  a  corrugated  silicon-on-insulator  waveguide  geometry.  The  measurements  were  made 
using  a  Nd:YAG  laser  at  a  wavelength  of  1.06  pm.  The  pulse  duration  was  around  30  ns  and 
the  pulse  energies  were  in  the  p J  range.  The  carrier  dynamics  was  complicated,  but  was  also  the 
source  of  interesting  unstable  dynamical  evolution. 

The  experiments  of  Herbert  et  al.  [38]  used  a  dye-doped  colloidal  crystal  as  a  distributed  Bragg 
reflector.  They  use  a  cw  dye  laser  to  tune  across  the  stop  gap  and  ramp  the  intensity  while  holding 
the  frequency  constant.  The  lattice  spacing  was  around  215  nm  and  the  index  modulation  is  weak; 
there  were  about  400  periods  of  the  unit  cell.  Depending  on  the  tuning  position  within  the  gap, 
the  transmissivity  showed  a  monotonic  behavior,  bistable  or  multi-stable  operation. 

We  believe  that  this  nonlinear  optical  behavior  will  constitute  the  necessary  mode-locking 
characteristics  required  for  a  fiber  ring  laser.  However,  the  nonlinear  response  of  Bragg  gratings 
in  fibers  has  not  been  investigated  and  this  mode-locking  techniques  constitutes  and  unproven 
concept.  First,  the  theoretical  details  need  to  be  flushed  out.  That  task  was  begun  this  past 
summer  during  my  participation  in  the  Photonic  Lab  summer  faculty  program.  The  approach  to  a 
feasibility  study  involves  analytic  methods[39]  that  have  been  developed  to  study  gap  solitons[40, 
41,  42]  and  numerical  methods[43]  that  we  have  built  up  over  the  past  ten  years.  The  two 
approaches  are  designed  to  provide  reliable  answers  on  the  operation  of  a  mode-locked  fiber  laser 
using  nonlinear  Bragg  gratings  as  elements.  Completing  the  analysis  will  require  further  work 
and  research  is  continuing  on  this  project.  The  experimental  group  of  Reinhard  Erdmann  and 
Kenneth  Teagarten  are  involved  in  the  project.  The  project  also  uses  a  student,  Walter  Kaechele, 
who  is  developing  experiments  at  the  Photonics  Laboratory. 

3  Figure-eight  Laser  Model 

Figure-eight  lasers  using  erbium-doped  fiber  amplifiers  are  of  special  interest  due  to  their  ability 
to  produce  nearly  transform  limited  pulses  at  a  very  high  repetition  rate  and  wavelength  operation 
at  the  minimum  loss  region  of  the  optical  fiber.  Their  operation  was  first  demonstrated  in  1990 
[44,  45]  with  two  mode-locking  configurations.  One  design  uses  the  nonlinear  optical  loop  mirror 
(NOLM)[44]  and  the  other  uses  a  nonlinear  amplifying  loop  mirror  (NALM)  by  Richardson[46] 
and  by  Duling[45,  47].  Bulushev[48]  simulated  the  NOLM  based  laser  by  direct  integration  of 
the  nonlinear  Schrodinger  equation  (NSE)  in  the  NOLM,  but  no  propagation  in  the  amplifier, 
which  was  modeled  as  a  homogeneously-broadened,  saturable  gain  medium.  Tzelpis  et  al.[49] 
reported  a  similar  analysis  on  the  NALM  based  figure-eight  lasers.  They  can  also  be  analyzed  as 
additive-pulse  mode-locking  (APM)  lasers[50]. 

It  has  been  experimentally  found  that  periodic  perturbations  to  a  soliton-like  pulse  produce 
spectral  sidebands  [52,  53,  54,  55,  56].  The  research  of  Dennis  et  al.[56]  has  quantitatively  examined 
this  phenomenon.  The  sidebands  are  a  result  of  dispersive  wave  shedding  that  circulates  in  the 
cavity  and  is  amplified  along  with  the  pulse  when  the  wavelength  is  phase  matched.  The  operation 
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of  the  figure-eight  laser  (F8L)  is  sensitive  to  the  total  dispersion  in  the  cavity,  which  includes  the 
amplifier  section  of  the  laser. 


Figure  2:  Sketch  of  the  figure-eight  laser  geometry  showing  the  important  elements.  The  isolator 
in  the  amplifier  section  is  depicted  as  a  diode  symbol. 


Here  I  discuss  simulations  of  certain  operational  aspects  of  the  F8L  laser  with  an  NOLM  that 
were  completed  during  the  summer  program.  This  work  has  been  submitted  for  publication  in 
the  Journal  of  Lightwave  Technology [51].  The  emphasis  is  placed  on  the  role  that  dispersion  and 
propagation  play  in  the  laser  operation.  Pulse  propagation  is  described  by  a  modified  NSE  and 
an  initial  seed  pulse  is  injected  into  the  cavity.  The  basic  features  of  our  model  are  shown  in  the 
diagram  in  Figure  2.  The  NOLM  is  a  loop  of  fiber,  whose  length  is  initially  chosen  as  4  soliton 
lengths  in  scaled  units.  Two  pulses  propagate  in  opposite  directions  around  the  loop  and  by 
virtue  of  their  different  amplitudes,  they  have  different  nonlinear  phase  shifts  upon  recombining 
at  the  60/40  cross  coupler.  The  amplifier  section  allows  pulse  propagation  only  in  the  clockwise 
direction;  the  counter-clockwise  direction  is  suppressed  by  the  action  of  the  isolator.  The  output  of 
our  model  laser  is  a  10  %  coupler  placed  just  before  the  entrance  to  the  NOLM.  Pulse  propagation 
is  described  by  a  modified  NSE  and  an  initial  seed  pulse  is  injected  into  the  cavity.  Not  shown  is 
the  amplifier  pump. 

The  numerical  solution  is  obtained  by  means  of  the  split-step  Fourier  transform  method[57]. 
The  fiber  amplifier  was  modeled  as  a  two-level  system  with  a  parabolic  line  shape  as  in  Ref. [58]. 
We  consider  the  case  of  a  60/40  directional  coupler  between  the  two  cavity  sections  in  Figure  1 
and  a  10  %  output  coupler.  Initially,  the  NOLM  loop  is  four  soliton  periods  long  in  scaled  units. 
Duling[59]  found  that  this  length  produces  minimum  loss  and  pulse  distortion.  As  the  pulse  width 
shortens  during  successive  round  trips  in  the  cavity,  the  soliton  period  correspondingly  shortens. 
It  was  assumed  that  the  population  inversion  in  the  amplifier  was  uniform,  and  for  steady-state 
conditions  it  totally  recovers  between  passes  of  the  pulse  through  the  amplifier. 


.dE  1  d2E 
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,  +~  +  E\E\2  =  i^E  +  i^t  (4) 

dz  2  dr2  2  or2 

The  equations  have  been  scaled  to  soliton  units  throughout  [57].  The  time  is  scaled  to  a  value 

To,  related  to  the  initial  pulse  width:  the  length  is  scaled  by  the  dispersion  length  Lq  —  Tq/\/32\, 

where  ^2  is  the  group  velocity  dispersion  parameter:  it  is  negative  in  the  wavelength  regime  near  1.5 
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fim;  fl2  ~  —20  ps2/km.  An  often  used  length  related  to  Ad  is  the  soliton  length  whose  definition  is 
Z0  =  -kLdI'2.  The  field  amplitude  scaling  corresponds  to  a  fundamental  soliton,  72|i?o|2  =  \^\ /T02, 
where  72  is  related  to  the  fiber’s  Kerr  nonlinearity  and  the  fiber’s  effective  core  area.  The  gain 
parameter  G  is  a  variable  in  our  simulations  and  the  gain  dispersion  parameter  n  =  GT$  is  a 
product  of  the  gain  parameter  and  the  polarization  relaxation  time  we  used  is  T2  =  100/s  in 
physical  units,  which  is  appropriate  for  erbium-libers.  All  our  scaled  parameters  are  based  on 
scaling  T0  =  300/s. 

The  left  hand  side  of  Eq.  (4)  has  the  elements  of  dispersion  and  nonlinearity  required  for  pulse 
propagation  in  an  optical  fiber.  Our  pulse  lengths  are  long  enough  that  effects,  such  higher-order 
dispersion,  stimulated  Raman  scattering  and  self-steepening,  are  negligible.  This  portion  of  the 
evolution  equation  is  applied  to  propagation  of  the  two  pulses  in  the  NOLM. 

The  two  additional  terms  on  the  right  hand  side  of  Eq.  (4)  are  included  in  the  simulation  when 
the  pulses  propagate  through  the  amplifier  section  of  the  laser.  They  describe  a  gain  curve  with 
a  maximum  at  the  pulse’s  center  frequency  and  a  parabolic  gain  profile;  the  gain  profile  is  much 
wider  than  our  pulse  spectra  and  this  approximation  is  not  a  limiting  factor. 

The  saturation  energy  of  the  amplifier  can  be  given  by  Es  =  /ji/0ae///2<T  .  This  expression 
can  be  found,  for  example,  in  reference  [58].  In  this  expression  z/0  is  the  central  frequency  of 
the  pulse,  aefj  is  the  effective  core  area,  and  a  is  the  absorption  cross  section  of  the  laser  line. 
The  parameters  which  effect  saturation  energy  are,  therefore  either  not  under  our  control,  or  are 
severely  constrained  by  amplifier  design  criteria.  Typically,  saturation  energy  will  be  found  to  be 
around  10  mJ.  This  implies  that  the  values  of  saturation  energy  used  in  references  [48]  and  [49] 
were  far  too  large  to  be  physically  realistic. 

For  realistic  values  of  the  saturation  energy,  the  pulse  initiation  and  growth  from  noise  cannot 
be  simulated  because  it  would  require  thousands  of  round-trips  in  the  cavity  before  steady-state 
operation  was  achieved.  When  we  start  with  enough  gain  to  overcome  the  loss  due  to  the  NOLM 
at  low  intensities,  the  amplifier  will  hardly  have  saturated  at  all  when  the  pulse  has  become  intense 
enough  to  have  reached  the  first  transmission  peak  of  the  loop  mirror.  The  pulse  will  then  break 
up,  becoming  broad  in  time  and  frequency  space.  This  is  a  condition  that  our  current  model  does 
not  handle  well.  For  this  reason,  we  have  limited  ourselves  to  studying  conditions  for  steady-state 
pulse  operation. 


3.1  Results  for  the  Figure-eight  Laser 

Consider  first  the  effect  of  the  fiber  amplifier’s  length.  Altering  the  length  changes  the  amount  of 
dispersion  in  the  cavity,  which  also  limits  the  minimum  pulse  length  in  the  cavity.  We  found  that 
stable  pulses  could  not  be  produced  if  the  amplifier  was  longer  than  0.7-0. 8  dispersion  lengths; 
we  attribute  this  effect  to  reshaping  the  pulse  in  the  laser  amplifier,  since,  during  this  portion 
of  the  evolution,  the  pulse  has  a  higher  energy  than  a  fundamental  soliton  energy  and  the  pulse 
tends  to  shed  energy  while  it  evolves  toward  a  fundamental  soliton  (this  will  be  discussed  further 
below).  Since  the  NOLM  loop  is  4  soliton  periods  long,  this  implies  a  total  loop  length  of  about 
7.0  dispersion  lengths.  For  amplifiers  shorter  than  0.8  dispersion  lengths,  we  could  only  achieve 
stable,  single  pulse  operation  for  a  narrow  band  of  values  for  the  gain  coefficient.  If  the  gain 
was  either  too  great,  or  too  small,  the  pulse  would  decay  away.  These  findings  are  summarized 
Figure  3. 
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Figure  3:  Stable  operation  regime  for  a  single  pulse  in  the  figure-eight  laser. 

To  generate  the  results  in  Figure  3,  simulations  were  run  using  a  single  pulse  that  is  launched  in 
the  cavity.  For  each  amplifier  length  a  range  of  gain  parameter  values  was  used  and  steady-state 
operation  was  sought.  The  entire  range  of  stable  pulse  operation  was  determined  by  altering  the 
the  gain  and  amplifier  length  parameters.  In  Figure  3  we  plot  the  gain,  which  is  related  to  the 
previously  defined  gain  parameter  by 

T  =  eGLa.  (5) 

The  parameter  La  is  the  amplifier  length.  The  parameter  F  is  a  measure  of  how  much  the  intensity 
increases  after  one  trip  through  the  amplifier.  The  curves  represent  the  lower  and  upper  bounds 
for  stable  single-pulse  operation.  For  a  very  short  amplifier,  the  gain  only  needs  to  overcome  the 
loss  due  to  the  output  coupler.  Length  dependent  loss  has  been  discussed  in  references  [56]  and 
[60].  An  amplifier  length  around  0.75  dispersion  lengths  is  sufficient  to  cause  an  instability  in  the 
pulse  shape  and  its  subsequent  disappearance  due  to  a  loss  of  transmission  through  the  NOLM. 

We  also  found  a  maximum  amplification,  beyond  which  a  single  pulse  in  the  loop  will  not  be 
stable.  For  higher  gains  the  cavity  would  adjust  to  include  two  pulses  in  the  cavity.  The  effect  of 
a  second  pulse  in  the  loop  will  be  to  cut  the  gain  coefficient  in  half,  which  would  reduce  the  gain 
to  the  point  where  stable  pulses  could  again  be  formed.  This  implies  a  pump  dependent  transition 
from  a  regime  with  one  stable  pulse  in  the  loop,  to  one  with  multiple  pulses.  Such  a  transition  is 
observed  in  NALM  based  lasers. 

Pulse  energy  evolution  depends  upon  the  gain  parameter.  One  gain  is  near  the  near  minimum 
amplification  boundary;  the  pulse  amplitude  is  quite  steady  and  varies  little  in  each  successive 
round  trip.  For  the  gain  near  the  maximum  amplification  boundary,  the  energy  in  the  pulse  decays 
and  recovers;  the  pulse  takes  a  much  greater  number  of  round  trips  to  settle  to  a  steady  state.  In 
this  latter  case,  the  pulse  shape  undergoes  considerable  change  during  its  evolution. 

Physical  insight  into  these  phenomena  can  be  found  by  examining  a  transmission  curve  for  the 
NOLM,  given  in  Figure  4.  The  curve  is  generated  using  hyperbolic-secant  shaped  input  pulses; 
input  pulse  amplitudes  near  2  closely  correspond  to  fundamental  solitons  in  the  NOLM,  where 
the  transmission  curve  has  a  strongly  peaked  maximum.  The  minimum  gain  limit  corresponds 
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Figure  4:  The  hyper-secant-pulse  transmission  curve  for  a  nonlinear  optical  loop  mirror.  The 
splitter  has  a  60/40  splitting  ratio. 


to  the  amplification  required  to  overcome  losses  in  the  laser,  so  greater  gain  represents  a  surplus 
energy  that  alters  the  amplitude  of  the  NOLM  input  pulse.  As  the  pulse  intensity  exceeds  that  for 
maximum  transmission,  the  pulse  will  experience  greater  loss  in  each  round  trip.  If  this  loss  is  not 
too  great,  it  can  recover.  However,  when  the  gain  is  so  great  that  the  transmission  falls  below  the 
recovery  threshold,  then  the  pulse  decays.  We  note  that  the  transmission  has  a  very  deep  minimum 
near  a  pulse  energy  of  4.  The  difference  between  the  minimum  and  maximum  stable  gains  implies 
that  the  pulse  with  an  amplitude  such  that  it  is  near  the  transmission  maximum,  need  only  an 
additional  amplification  by  about  10  %  (to  recover  the  loss  due  to  the  output  coupler).  When  the 
gain  is  below  the  minimum,  there  is  not  enough  replenishment  to  sustain  the  pulse  in  the  cavity 
and  when  the  gain  is  above  the  maximum  the  pulse  amplitude  from  the  amplifier  places  the  pulse 
it  in  a  very  steep  part  of  the  NOLM  transmission  curve.  The  pulse  experiences  too  great  a  loss 
through  the  NOLM  and  will  not  be  able  to  recover  in  the  next  pass  through  the  amplifier. 

The  directional  coupler  for  the  NOLM  loop  produces  two  pulses  of  roughly  half  the  input 
intensity,  traveling  in  opposite  directions.  This  would  make  them  approximately  unit  solitons, 
and  as  such  subject  to  minimum  distortion  during  propagation.  However,  when  these  pulses  are 
recombined  and  propagate  in  the  amplifier,  they  tend  to  be  reformed  into  a  soliton  shape  suitable 
to  their  amplitude,  and  to  shed  some  light  to  dispersive  wave  radiation.  The  degree  of  transmission 
in  the  NOLM  loop  is  highly  dependent  on  pulse  shape,  and  the  reshaped  pulse  will  experience 
greater  loss  than  the  original  sech  shape.  If  this  loss  becomes  too  great,  the  laser  will  simply  cease 
to  function. 

The  steady-state  pulse  shape  for  an  amplifier  length  of  0.1  dispersion  lengths  is  well  fit  in  our 
numerical  simulations  by  a  hyperbolic  secant  shaped  envelope;  in  the  NOLM  this  pulse  is  split 
into  two  pulses  that  are  good  approximations  to  fundamental  solitons.  The  amplifier  is  short,  so 
that  there  is  a  small  amount  of  additional  dispersion  in  the  cavity.  The  fit  of  the  pulse  intensity 
to  a  fundamental  soliton  shape  for  this  case  is  much  narrower  than  the  actual  pulse  width. 

The  spectrum  of  this  pulse  a  central  section  that  is  also  a  hyperbolic-secant  shape,  but  there 
is  additional  structure  in  the  wings  that  is  due  to  dispersive-wave  shedding.  The  shoulders  are 
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consistent  with  the  placement  of  the  sideband  peaks  given  by  the  formula[47] 


um  =  \JSmZ0/L  -  1;  (6) 

where  L  is  the  length  of  the  fiber  and  Zq  is  the  soliton  length  corresponding  to  the  steady-state 
pulse  width. 

The  amplifier,  whose  length  is  0.7  dispersion  lengths,  has  a  central  region  that  is  well  fit  by 
a  soliton  envelope,  but  the  wings,  having  shed  radiation  are  much  larger.  Here  we  emphasize 
again  that  the  soliton  envelope  is  constrained  by  both .  its  amplitude  and  width.  Compaxison 
between  the  short  and  long  amplifier  cases  shows  that  pulse  propagation  over  a  longer  distance 
in  the  amplifier  is  better  approximated  by  a  fundamental  soliton  shape.  There  is  a  much  more 
pronounced  sideband  structure  that  is  amplified  in  the  cavity.  The  sideband  peak  positions  are 
well  approximated  by  the  expression  in  Eq.  (6). 

Contrasting  the  two  cases  of  a  short  and  a  long  amplifier  length,  we  conclude  that  the  longer 
amplifier  reshapes  the  pulse  toward  a  fundamental  soliton.  Therefore,  as  the  pulse  is  split  in  two 
by  insertion  into  the  NOLM  it  undergoes  reshaping  again  toward  a  much  different  fundamental 
soliton  amplitude.  In  other  words,  the  NOLM  and  the  amplifier  shaping  mechanisms  compete 
against  one  another.  This  interpretation  of  our  results  suggests  a  a  strategy  to  improve  the  mode¬ 
locking  stability  of  the  F8L.  We  proposed  to  balance  the  dispersion  in  the  NOLM  and  the  amplifier 
sections  in  order  to  minimize  the  pulse  reshaping  in  each  cavity  [62]. 

3.2  Dispersion  Balanced  Figure-eight  Laser 

In  our  previous  Subsection,  we  modeled  the  behavior  of  a  fiber  laser  mode-locked  by  a  nonlinear 
optical  loop  mirror  (NOLM).  A  schematic  of  the  laser  is  shown  in  Figure  2.  The  loop  mirror  has 
a  length  of  four  soliton  periods.  A  60/40  directional  coupler  was  used  in  the  center,  and  10%  of 
the  pulse  energy  was  coupled  out  of  the  cavity.  We  found  that  the  pulse  experienced  a  loss  which 
was  dependent  on  the  length  of  the  amplifier  and  we  couldn’t  stabilize  the  pulse  output  if  the 
length  of  the  amplifier  was  longer  than  0.83  dispersion  lengths.  This  result  led  us  to  propose  a 
new  strategy  to  improve  the  operational  stability  of  the  F8L. 

Our  analysis  showed  that  this  instability  is  due  to  the  competing  tendencies  of  the  amplifier 
and  the  NOLM  to  reshape  the  pulses  toward  a  fundamental  soliton  shape.  When  two  pulses  in 
the  NOLM  favor  a  fundamental  soliton  shape,  then  maximum  transmission  is  produced  for  a  pulse 
with  a  field  envelope  with  a  shape  2.2sech(r),  where  the  intensity  and  r  are  in  soliton  units.  For 
this  situation  the  pulse  will  be  transmitted  with  only  a  small  amount  of  distortion.  The  two  pulses 
at  the  output  of  the  NOLM  are  recombined  and  are  not  a  fundamental  soliton  in  the  amplifier. 
When  the  fiber  amplifier  is  extended,  the  pulse  is  reshaped  toward  a  fundamental  soliton  and  for 
a  long  amplifier,  it  approaches  the  fundamental  soliton  shape.  We  attribute  the  length  dependent 
loss  to  the  fact  that  this  reshaping  process  resulted  in  a  pulse  shape  which  was  not  correct  for 
complete  transmission  through  the  NOLM. 

This  understanding  of  the  length  dependent  loss  suggests  a  possible  alternate  solution.  We 
should  design  the  amplifier  fiber  so  that  the  pulse  is  a  fundamental  soliton  in  that  fiber,  as  well 
as  in  the  NOLM;  this  balances  the  two  sections  of  the  laser  so  that  the  pulses  are  always  close  to 
a  fundamental  soliton  shape  and  that  perturbations  of  that  shape  are  kept  to  a  minimum.  For  a 
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where  7  is  the  nonlinear  coefficient,  P0  is  the  peak  power,  To  is  the  pulse  width  and  /?2  is  the  dis¬ 
persion.  The  parameter  7  contains  both  the  intrinsic  material  parameters  and  the  fiber  geometry; 
we  assume  that  it  is  constant  in  our  simulations.  Since,  we  desire  that  To  be  the  same  in  both 
sections,  then  the  dispersion  must  be  increased  by  a  factor  of  around  2.2  to  compensate  for  the 
increased  peak  intensity  at  the  output  of  the  NOLM. 


amplifier  length 

Figure  5:  Stability  regime  for  stable  single  pulse  operation  is  a  band  in  the  amplifier  gain  between 
about  1.1  and  1.3. 

This  design  feature  was  incorporated  into  our  laser  model.  Figure  5  shows  the  regions  in 
which  stable  pulses  could  be  formed.  This  figure  is  in  sharp  contrast  to  the  stability  regime  of 
the  ordinary  F8L  configuration  shown  in  Fig.  3.  It  should  be  noted  that  in  this  case  the  limit  of 
the  abscissa  of  the  graph  extends  to  25  dispersion  lengths;  the  pulse  is  very  well  approximated 
by  a  fundamental  soliton  and  for  further  amplifier  lengths,  no  change  is  expected.  It  should  be 
further  noted  that  the  gain  required  for  stable  operation  is  quite  constant,  representing  minimal 
gain  dependent  loss  and  a  result  of  the  soliton  pulse  shape.  When  the  amplifier  is  longer  than 
about  5  dispersion  lengths,  pulse  reshaping  is  minimal,  and  the  maximum  and  minimum  stable 
gains  are  nearly  constant.  If  the  amplifier  is  shorter  than  this,  the  pulses  deviate  from  a  soliton 
shape  and  the  output  from  the  NOLM  is  sensitive  to  its  input  pulse  shape. 

To  obtain  insight  into  the  functioning  of  this  laser,  the  pulse  shape  was  examined  at  the  point 
at  which  it  left  the  output  coupler,  and  before  it  entered  the  directional  coupler;  it  is  shown  in 
Fig.  6.  The  case  that  was  examined  was  for  a  25  dispersion  length  amplifier,  with  a  gain  parameter, 
G  =  0.0056;  This  corresponds  to  an  amplifier  gain  of  eGL  =  1.15.  The  output  pulse  has  a  nearly 
uniform  phase.  The  pulse  intensity  corresponds  almost  exactly  to  that  of  a  fundamental  soliton, 
with  a  pulse  intensity  of  about  2.7.  In  comparing  the  relationship  of  the  pulse  width  to  pulse 
height  it  must  be  remembered  that  the  ratio  has  been  altered  by  a  factor  of  2.2. 

Figure  7  shows  the  pulse  in  frequency  space.  Sidebands  have  clearly  developed.  The  separation 
of  the  first  order  sideband  from  the  center  is  0.05  in  normalized  frequency  units,  which  is  consistent 
with  dispersive- wave  shedding[53].  In  the  case  of  a  F8L,  the  soliton  period  varies  as  the  pulse 


10-14 


a 

o» 

« 


time 


Figure  6:  Pulse  shape  and  the  phase  of  the  pulse  after  propagating  through  a  25  dispersion  length 
amplifier.  The  minimum  amplifier  gain  of  1.1  was  used  and  the  pulse  is  seen  after  the  output 
coupler. 


Figure  7:  The  pulse  spectrum  of  the  pulse  in  Fig.  3.  The  side  bands  are  reduced  in  this  balanced 
laser;  their  position  is  consistent  with  soliton  perturbation  theory,  see  Refs.  4,6. 

travels  through  the  laser.  Since  the  length  of  the  amplifier  is  25  dispersion  lengths,  this  section 
alone  has  a  length  far  longer  than  eight  soliton  periods  usually  found  to  limit  operation  of  the  F8L; 
a  restriction  that  also  applied  to  our  previous  F8L  simulation.  This  is  significant  as  the  length 
appears  to  represent  a  length  limit  for  ring  lasers  and  we  attribute  this  to  the  improved  rejection 
of  the  dispersive  wave  in  our  NOLM  because  the  pulses  are  very  nearly  fundamental  solitons.  We 
also  find  that  the  dispersive  wave  component  in  the  spectrum  is  increased  when  the  cross-coupler 
splitting  ratio  is  closer  to  50/50  and  the  length  of  the  NOLM  is  correspondingly  increased. 

The  existence  of  a  maximum  amplifier  length  sets  a  limit  of  the  shortest  pulse  that  can  be 
produced  by  a  given  laser.  If  the  physical  length  of  the  amplifier  is  given  by  «,  and  the  maximum 
amplifier  length  in  dispersion  lengths  is  Lmax  the  limit  would  be  expressed  as 


^  —  LmaxLo-. 


(8) 
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Since  the  dispersion  length  can  be  defined  as  Ld  =  T02/ 1/?2|,  Eq.  (8)  can  be  transformed  to 


The  limit  on  Lmax  is  related  to  the  dispersive  wave  soliton  resonance  of  8  soliton  periods  [60], 
but  since  this  laser  has  reduced  side  band  amplitudes,  this  limit  no  longer  applies.  Our  results 
indicate,  however,  that  it  is  possible  to  decrease  the  shortest  possible  pulse,  while  increasing  the 
average  dispersion  in  the  laser,  which  is  a  possibility  which  has  not  been  explored  before.  Previous 
work  has  centered  on  optimizing  the  performance  of  the  laser  by  using  fiber  with  normal  dispersion 
[60,  32].  It  should  be  noted  that  our  method  for  optimizing  the  laser  could  be  implemented  by 
decreasing  the  dispersion  of  the  fiber  in  the  NOLM  loop. 

3.3  Conclusions 

The  final  pulse  widths  depend  on  several  factors  including,  the  amplifier  gain  and  length,  and  the 
length  of  the  NOLM.  In  Figures  5  and  6,  the  full  width  at  half  maximum  corresponds  to  about 
2.5  and  1.5  in  scaled  units,  resp..  In  physical  units  the  pulses  have  widths  of  about  750  fs  and  450 
fs.  The  higher  gain  parameter  has  a  shorter  width  due  to  stronger  nonlinear  shaping  mechanisms. 
We  can  also  work  toward  shorter  pulses  by  adjusting  the  cavity  length.  Naturally,  as  the  pulses 
shorten  the  propagation  loss  effects  mentioned  in  Section  3  must  incorporated  into  the  analysis 
and  the  amplifier  model  should  incorporate  a  more  accurate  gain  profile  shape  and  the  dynamical 
evolution  of  the  active  medium. 

In  conclusion,  we  have  developed  a  simulation  that  predicts  steady-state  properties  of  the 
NOLM  fiber  laser.  We  have  found  in  our  results  a  length  dependent  minimum  pulse  width, 
a  length  dependent  loss,  which  has  been  observed  by  experimental  groups,  but  has  not  been 
reported  in  previous  simulations.  All  prior  F8L  simulations  do  not  incorporate  propagation  in 
the  amplifier.  We  have  also  found  the  existence  of  a  maximum  application  for  stable  single-pulse 
operation  and  observed  how  the  final  pulse  shape  is  dependent  on  the  length  of  the  laser  cavity, 
including  propagation  in  the  gain  medium.  Finally,  we  have  observed  dispersive  wave  shedding  in 
the  cavity,  especially  for  larger  gains.  The  side  bands  in  the  power  spectrum  are  consistent  with 
the  results  discussed  by  Dennis  et  al.  [56].  The  maximum  gain  instability  occurs  for  an  effective 
cavity  length  around  7-8  soliton  lengths. 

The  final  pulse  widths  depend  on  several  factors  including,  the  amplifier  gain  and  length,  and 
the  length  of  the  NOLM.  In  physical  units  the  pulses  have  widths  of  about  750  fs  for  the  shorter 
amplifier  and  about  450  fs  for  the  longer  amplifier.  The  higher  gain  parameter  has  a  shorter 
width  due  to  stronger  nonlinear  shaping  mechanisms.  We  can  also  work  toward  shorter  pulses  by 
adjusting  the  cavity  length.  Naturally,  as  the  pulses  shorten  the  propagation  loss  effects,  such  as 
stimulated  Raman  scattering,  must  incorporated  into  the  analysis  and  the  amplifier  model  should 
incorporate  a  more  accurate  gain  profile  shape  and  the  dynamical  evolution  of  the  active  medium. 

4  Summary  of  Research 

The  dispersion-balanced  F8L  is  an  excellent  candidate  for  future  experimental  work.  Based  on 
our  numerical  simulation,  when  the  fiber  dispersion  parameters  are  matched,  a  greatly  improved 
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operational  stability  of  the  lasers  is  found.  The  pulse  shape  is  more  closely  approximated  by  a 
fundamental  soliton  shape,  i.e.  there  is  a  better  correspondence  of  the  pulse  height  and  width  in 
both  time  and  frequency  domains  to  that  of  a  fundamental  soliton. 

During  the  Summer  Faculty  Program  two  papers  were  prepared  for  publication  and  submitted 
to  technical  journals.  These  papers  cover  steady-state  operation  of  F8Ls.  The  novel  feature  we 
propose  for  future  F8Ls  is  to  apply  our  design  feature  called  dispersion  balancing,  which  simply 
places  a  fiber  with  approximately  double  the  dispersion  of  the  NOLM  in  the  amplifier  section 
of  the  laser.  This  prevents  break-up  of  the  pulses  and  stable  operation  because  both  sections 
propagate  pulses  that  are  nearly  fundamental  soliton  shapes. 

Research  was  also  begun  on  a  new  passive  mode-locking  element  called  a  fiber  Bragg  reflector. 
This  element  takes  advantage  of  the  nonlinear  operation  of  a  fiber  grating,  when  a  pulse  is  tuned 
inside  the  stop  band.  The  stop  band  has  low  transmission  when  operating  in  the  linear  regime, 
but  at  sufficiently  high  intensity,  the  pulse  is  transmitted.  An  expanded  program  of  research  on 
this  new  mode-locking  element  is.  currently  being  pursued.  There  is  also  experimental  support  for 
the  concept  at  the  Photonics  Laboratory  in  Rome,  NY  and  the  details  will  be  worked  out  during 
the  coming  year. 
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ABSTRACT. 


We  have  fabricated  and  measured  the  polarization  dependence  of  the  transmission  spectrum 
of  Metal  Strip  Polarizing  Fibers.  The  measurements  were  performed  at  the  Photonics  Center  of 
Rome  Laboratories.  These  devices  were  fabricated  by  Syracuse  University.  We  have  shown  that 
the  metals  survive  the  fabrication  process.  We  where  able  to  measure  the  plasma  resonance 
absorption  of  the  metal  strips. 
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METAL  STRIP  POLARIZING  FIBERS 

Philipp  Kornreich 


INTRODUCTION 


We  have  tested  the  transmission  spectra  and  polarizing  properties  ot  Metal  Strip  Polarizing 
Fibers  (MSPFs)  that  were  fabricated  at  Syracuse  University.  The  MSPFs  consist  of  a  glass  core 
flanked  on  two  sides  by  thin  by  a  thin  semiconductor  cylinder.  That  is  the  metal  strips  are  lo¬ 
cated  at  the  core  cladding  interface,  see  Fig.  1.  Light  polarized  parallel  to  the  metal  strips  is 
absorbed  while  light  polarized  perpendicular  to  the  strips  is  not  absorbed.  We  have  used  a  va¬ 
riety  of  materials  to  fabricate  these  strips.  We  have  used  silver,  copper  and  CdTe.  A  number  of 
eacn  type  of  fiber  were  fabricated  by  Syracuse  University  and  tested  at  the  Photonics  Center  of 
Rome  Laboratories.  Spectral  measurements  at  the  Photonics  Center  of  Rome  Laboratories 
showed  that  these  materials  survived  the  fiber  fabricating  process.  The  fibers  had  reproducible 
characteristics. 


Fig.  1.  Typical  MSPF  construction.  The  device  consists  of  a  glass  core  with  two 
thin  metal  strips  at  the  core  cladding  interface. 

1  p 

Several  type  of  fiber  polarizers  have  been  constructed  by  other  researcher  ’  .  A  particular 

successful  type  was  constructed  by  grinding  away  some  of  the  cladding  of  in  glass  are  a  prede- 
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cessor  of  this  technology.  SMs  have  a  large  number  of  surface  states  at  their  interfaces  with  the 
host  giass.  A  metal  block  was  placed  in  the  space  where  the  cladding  was  ground  away,  see  Fig.  2. 
Extinction  ratios  of  70  dB  have  been  reported  for  these  devices. 


Fig.  2.  D  shaped  polarizing  fiber  with  metal  block. 


Based  on  our  successful  experience  of  fabricating  Semiconductor  Cylinder  Fibers  (SCFs) 

which  we  have  reported  last  summer  we  attempted  to  use  the  same  techniques  for  fabricating 
MSPFs. 

The  3M3,4  company  sells  commercial  polarizing  fibers.  However  each  of  these  is  about  40 
m  long  while  the  MSPF  are  oniy  e  few  mm  long. 

MSPF  FABRICATION 

i  he  MSPFs  were  fabricated  by,  first,  vacuum  depositing  metal  film  strips  on  two  sides  of  a 
Pyrex  glass  rod  that  will  form  the  core  of  the  fiber  in  a  diffusion  pumped  vacuum  system.  The 
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glass  rod  is  clamped  in  a  mask  as  shown  in  Fig.  3  for  this  deposition  process.  A  Pyrex  glass  was 
selected  that  has  a  softening  temperature  of  720  °C.  This  is  lower  than  the  melting  points  of  the 
metals  used. 


-  Glass  rod 

Fig.  3.  Mask  for  vacuum  depositing  metal  strips  on  core  glass  rod. 


Next,  the  metal  coated  glass  rod  was  inserted  into  a  Pyrex  glass  tube  that  has  been  closed  at 
one  end.  This  structure  is  evacuated  and  collapsed.  N2  is  used  as  the  residual  gas  in  the  evacua¬ 
tion  process.  Finally,  a  fiber  is  pulled  from  the  resulting  preform. 

EXPERIMENTAL  RESULTS 


In  order  for  this  device  to  be  practical  it  is  necessary  that  only  a  single  mode  propagate  in 
the  fiber.  This  requires  that  the  core  have  a  sufficiently  small  diameter  and  that  the  core  have  a 
slightly  higher  index  of  refraction  than  the  cladding.  Unfortunately  we  were  not  able  to  secure 
glass  rod  that  have  a  sufficiently  small  diameter  to  yield  sufficiently  small  fiber  cores,  and 
worth,  the  core  and  cladding  glasses  had  the  same  index  of  refraction.  Recall,  the  Semiconductor 
Cylinder  Fibers  (SCF)  that  we  tested  last  year  had  a  similar  problem.  However,  since  the 
semiconductors  completely  enclosed  the  core  it  facilitated  some  guiding.  This  is  not  the  case  with 
the  metal  strips. 

Several  metal  strip  fibers  were  fabricated  and  tested.  The  metal  films  deposited  on  the  glass 
rods  where  approximately  3000  A  thick.  Since  all  components  shrink  proportionally  in  the 
fiber  fabricating  process  by  a  factor  of  about  94  the  metal  films  should  have  a  thickness  of 
about  32  A.  Not  even  a  SEM  could  resolve  these  films.  Therefore,  we  are  having  these  films 
analyzed  by  a  Scanning  Force  Microscope  at  Rome  Laboratories.  To  date  we  are  still  waiting  for 
these  tests  to  be  performed. 

We  here  present  the  test  results  of  Ag  strip  fibers.  These  fibers  had  approximately  35  pm 
diameter  Pyrex  glass  cores  and  80  pm  diameter  Pyrex  glass  claddings.  Fibers  of  various  length 
ranging  from  120  mm  to  12  mm  length  were  tested.  The  12  mm  long  devices  worked  best.  The 
results  presented  here  were  for  the  12  mm  long  fibers.  As  stated  above  material  availability 
restricted  us  to  use  glass  with  the  same  index  of  refraction  for  both  the  core  and  cladding. 
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The  following  test  arrangement  was  used:  Light  from  a  white  light  source  was  focused  by  a 
microscope  objective  into  the  fiber.  Another  microscope  objective  was  used  to  retrieve  the  light 
from  the  fiber.  An  iris  was  used  to  block  the  light  from  the  cladding.  These  fibers  had  exceed¬ 
ingly  poor  guiding.  The  light  passed,  next,  through  a  Glen  Thompson  Polarizing  Prism.  Finally 
the  light  was  focused  into  a  ANDO  type  AQ  1425  optical  spectrum  analyzer  connected  to  a  com¬ 
puter.  “Lab.  Window”  software  was  used  to  control  the  spectrum  analyzer  and  record  the  data. 

The  data  taking  procedure  was  as  follows:  It  is  important  to  note  that  the  spectrum  analyzed 
is  somewhat  polarization  dependent.  A  piece  of  MSP  fiber  was  positioned  in  the  fiber  holder. 
Transmission  date  using  the  spectrum  analyzer  was  recorded  for  consecutive  angular  positions 
of  the  Glen  Thompson  Polarizer.  The  data  was  taken  in  10°  steps.  The  Glen  Thompson  Polarizer 
was  rotated  a  total  of  180°.  This  experiment  for  exactly  the  same  Glen  Thompson  Polarizer  po¬ 
sitions  was  repeated  for  a  piece  of  standard  multimode  fiber  which  was  used  as  a  reference.  The 
reason  for  this  procedure  is  that  the  fiber  could  never  be  repositioned  with  the  same  angular 
position  in  the  fiber  holder.  Of  course,  it  is  not  possible  to  rotate  the  fiber  and  maintain  optical 
alignment.  The  data  from  the  multi  mode  fibers  was  used  to  normalize  the  date  from  the  MSPFs. 


Fig.  4  Normalized  transmission  spectrum  of  MSP  fiber  for  Glen  Thompson  po¬ 
larizer  position  of  78°.  This  corresponds  to  the  maximum  transmission  through 
the  fiber.  The  “bulge”  at  about  1200  nm  is  the  most  significant  part  of  the  data. 
Note  the  OH  absorption  at  about  1400  nm  which  is  larger  in  the  Pyrex  fiber  than 
in  the  commercial  fiber. 
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Fig.  5.  Normalized  transmission  spectrum  of  MSP  fiber  for  Glen  Thompson 
polarizer  position  of  168°.  This  corresponds  to  the  minimum  transmission 
through  the  fiber.  Again, note  the  missing  of  the  “bulge”  at  about  1200  nm.  Note 
the  OH  absorption  at  about  1400  nm  which  is  larger  in  the  Pyrex  fiber  than  in 
the  commercial  fiber. 

fibers.  This  limited  the  effect  of  the  polarization  dependence  of  the  optical  spectrum  analyzer. 
The  polarization  dependence  of  transmission  spectra  of  Ag  strip  as  well  as  of  CdTe  strip  fiber 
sections  were  obtained. 

The  normalized  transmission  spectrum  of  an  Ag  MSP  fiber  at  Glen  Thompson  Polarizer  po¬ 
sitions  of  78°  and  168°,  corresponding  to  minima  and  maxima  of  the  transmission  through  the 
MSP  fiber  are  shown  in  Fig’s  4  and  5.  Of  course,  we  have  plotted  data  of  these  fibers  in  10° 
polarization  steps. 

The  logarithm  of  the  ratio  of  maximum  and  minimum  normalized  transmission  data,  the  data 
presented  in  Fig’s.  4  and  5  is  shown  in  Fig.  6.  Note  the  “large”  peak  at  about  1230  nm.  The 
normalization  with  respect  to  the  muitimode  fiber  should  eliminate  effects  of  the  polarization 
dependence  of  the  optical  spectrum  analyzer.  Note  from  Fig.  6  that  the  OH  absorption  was 
eliminated  in  this  process.  In  order  to  make  sure  that  the  resonance  peak  in  Fig.  6  is  not  due  to 
the  polarization  dependence  of  the  spectrum  analyzer  we  analyzed  the  ratio  of  other  normalized 
transmission  spectra.  For  example,  the  logarithm  of  the  ratio  of  the  normalized  transmission 
spectrum  of  date  with  the  Glen  Thompson  Polarizer  positioned  at  118°  and  208°  is  shown  in  Fig. 
7.  No  resonance  peak  appears  in  this  data.  Of  course,  data  with  the  Glen  Thompson  Polarizer 

adjusted  to  angles  close  to  1 68°  and  78°  exhibit  smaller  resonance  peaks. 
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Fig.  6.  Ratio  of  normalized  transmission  spectrum  of  MSP  fiber  for  Glen 
Thompson  polarizer  positions  of  1 68°  and  78°.  Note  the  plasma  resonance  peak  at 
1230  nm. 


Wavelength  in  nm 


Fig.  7.  Ratio  of  normalized  transmission  spectrum  of  MSP  fiber  for  Glen 
Thompson  polarizer  positions  of  208°  and  118°.  Note  the  absence  of  the  plasma 
resonance  peak  at  1230  nm. 
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Metals  exhibit  plasma  resonances  at  optical  frequencies.  We  believe  that  the  peak  observed 
in  Fig.  6  is  a  plasma  resonance  of  the  Ag  film. 

Of  course,  the  present  fibers  with  their  large  non  guiding  cores  are  not  practically  useful. 
Nevertheless,  we  were  able  to  obtain  some  interesting  data  for  the  Ag  strip  fibers. 

Syracuse  University  has  glass  on  order  that  will  allow  the  fabrication  of  single  mode  MSP 
fibers.  To  make  these  fiber  polarizers  commercially  viable  it  is  necessary  to  find  metals  that 
have  plasma  resonances  near  wavelength  of  1.3  nm  and  1.55  nm. 
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Abstract 


This  report  discusses  a  framework  for  visualization  in  imagery  exploitation  applications.  We  evaluate 
a  recently -developed  tool  ( called  VIA  (Visualization  for  Image  Analysts))  whose  goal  was  to  aid  image 
analysts  through  the  use  of  visualization  techniques.  The  strengths  and  shortcomings  of  VIA  are 
discussed  and  suggestions  for  improvements  and  new  directions  for  this  tool  are  made.  More  genetically, 
we  also  propose  a  re-casting  of  the  usage  of  visualization  techniques  within  the  imagery  exploitation 
arena,  especially  for  battle  damage  assessment  (BDA),  mission  rehearsal,  and  training  regimens  for 
image  analysts.  We  also  present  our  new  framework  for  an  improved  graphical  user  interface  (GUI)  for 
the  VIA  tool.  The  new  framework  conforms  to  current  directions  and  preferred  practices  in  graphical 
user  interface  design. 
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1  Introduction 


Visualization  involves  the  use  of  computer-synthesized  images  or  pictures  to  discover  or  ***** 

geometry  and  topology  (that  is,  the  relationships  between  the  data)  [2,  9].  Visualization  also  often 
involves  the  hi-resolution  animation  of  time-varying  data;  in  fact  the  original  understanding  and  definition 
of  visualization  (from  a  1987  National  Science  Foundation  Report)  considered  this  to  be  the  hallmark 
of  visualization  (8].  Today,  visualization  also  often  involves  the  use  of  cross-disciplinary  techniques, 
specifically  the  use  of  sophisticated  scientific  tools  from  other  disciplines  [9].  Topics  of  current  investigation 
within  the  visualization  communities  include  extraction  and  display  of  useful  features  from  large  datasets; 
using  imaging  representations  and  interactive  graphics  in  manipulation  and  rendering,  including  volumetric 
modeling  and  rendering,  such  as  viewing  the  intenor  of  volumes;  studying  human  peroepuon  of  information; 
multi-media  presentation  of  information,  the  portrayal  of  error  and  uncertainty  in  data;  and  "augmented 
reality”  (also  called  virtual  reality). 

In  this  final  report  of  our  1995  Summer  Faculty  Research  Program  experience,  we  present  our  wort 
on  the  development  of  a  framework  for  the  usage  of  visuahzauon  techniques  in  imagery  exploitation 
applicauons.  parucularly  in  image  analysis.  In  Secuon  2  olothe  report,  background  material  on  the  usage  oi 
visualization  techniques  for  cartographic  and  image  expioitauon  purposes  is  presented.  In  Section  3,  we 
evaluate  the  (Gmmman-developed)  Visualizauon  for  Image  Analysis  (VIA)  tool's  aims  and  goals  and  the 
capability  of  the  tool  to  meet  those  goals.  The  fourth  secuon  of  the  report  discusses  the  framework  that 
we  are  developing  to  improve  the  user  interface  ( UI)  of  the  VIA  tool  and  presents  our  progress  to  date  or 
this  effort  We  conclude  the  report  with  a  discussion  of  potential  future  directions  for  visualization  in  the 
imagery  expioitauon  arena. 

An  addendum  to  the  report  brielly  discusses  a  second  area  of  invesugauon  for  our  summer  study,  the 
dual-use  Stausucal  Multiple  Object  Detecuon  and  Locauon  System  (S-MODALS)  aruricial  neural  network 
(ANN)  technology.  In  the  addendum,  we  discuss  our  attempts  to  integrate  the  technology  into  the  IE200C 
environment. 
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2  Visualization  for  Image  Exploitation 


The  presentation  of  cartographic  data  has  been  studied  extensiveiy  over  many  years  and  has  produce 
a  number  of  heuristic  guidelines  for  mapping.  For  example,  it  is  understood  that  color  must  be  uset 
carefully  and  appropriately  to  highlight  relationships  and  patterns  within  map  data.  Many  of  the  mappinj 
guidelines  are  not  formal  rules  but  rather  rely  on  the  experience  of  the  mapmaker.  Currently,  there  is  interei 
within  cartography  in  the  promotion  of  human-map  interaction  through  computer-aided  visualization  o 
map  data.  Some  of  the  topics  being  explored  in  cartographic  visualization  include  the  depiction  of  chang 
and  movement,  the  use  of  multimedia,  and  the  presentation  of  multiple  views  of  data  to  aid  in  hrnnai 
understanding  [5],  Many  of  these  investigations  currently  focus  on  how  the  power  of  interactive  compad*] 
can  be  used  to  aid  user  exploration  of  cartographic  data 

Some  of  the  experiences  and  techniques  from  cartographic  visualization  may  be  applicable  in  task 
within  image  exploitation  that  would  benefit  from  visualization  capabilities.  Techniques  for  presentinj 
cartographic  information  on  a  computer  monitor  may,  however,  differ  from  many  of  the  techniques  useftd.ii 
visualization  of  cartographic  or  image  data  on  paper.  For  example,  the  size  of  a  computer  monitor  is  limitei 
whereas  paper  maps  large  enough  to  cover  an  entire  wall  can  be  created.  Furthermore,  the  resolution  c 
most  computer  monitors  is  generally  limited  to  no  larger  than  approximately  1000  x  1000  screen  element 
(i.e.,  no  more  than  100-125  pixels  per  inch).  Thus,  information  must  be  carefully  presented  on  compute 
monitors. 

The  purpose  of  the  project  that  produced  the  VIA  tool  was  to  explore  the  feasibility  of  using  visualizatioi 
techniques  in  an  image  exploitation  task.  Specifically,  the  goal  was  to  aid  image  analysts.  In  imagi 
exploitation,  two  of  the  interests  are  <I)  searching  to  find  targets  and  other  features  of  interest  and  (2 
detecting  subtle  changes  to  identify  clandestine  operauons.  Targets  usually  occupy  very  small  region 
in  typical  imagery  products  and  furthermore  are  someumes  oriented  at  a  bad  angle  or  camouflaged  witJ 
netting.  There  also  tends  to  be  minimal  changes  between  images  in  a  time  series,  making  detection  o 
change  very  difficult.  The  presence  of  shadows  in  an  image  can  also  complicate  an  image  analyst’s  tasi 
By  providing  visualization  cues  to  the  anaiyst.  his  or  her  attention  could  be  focused  into  areas  of  the  imagi 
more  likely  to  contain  targets. 

VIA  incorporated  three  visualization  techniques  including  perspecuve  viewing  of  aerial  or  satellit 
imagery  (such  as  Landsat.  SPOT,  ADRI,  etc.),  overlaying  feature  information  on  images,  and  dispiayin 
3D  target  keys.  Grumman  hoped  that  the  perspective  viewing  might  also  help  a  pilot  in  navigation  o 
target  acquisition,  in  addiuon  to  helping  image  analysts  in  the  creation  of  targetting  products.  Research  ii 
perception  reveals  that  perspective  and  shadowing  are  very  useful  in  human  determination  of  position  ani 
orientation  (3J. 
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3  An  Evaluation  of  VIA 


In  this  section,  we  evaluate  the  VIA  tool  and  propose  a  number  of  potential  directions  for  future 
enhancements.  One  of  the  strengths  of  VIA  is  that  it  provides  a  set  of  visual  cues  and  helps  for  image 
analysis  tasks.  Hie  package’s  visualization  capabilities  are  helpful,  although  they  are  also  somewhat  limited 
and  tend  to  be  somewhat  cumbersome  to  use,  particularly  for  novice  users.  The  package  could  be  made 
more  useful  with  a  number  of  extensions,  including  a  more  intuitive  graphical  user  interface. 

3.1  VIA  Strengths 

VIA  has  a  number  of  beneficial  features.  One  strength  is  that  it  supports  the  overlay  of  surface  material 
features.  The  overlay  capability  is  useful  in  locating  certain  features  in  a  sparse  environment  (for  example, 
roads  in  desen  areas).  The  overlay  feature  also  assists  in  the  display  data  that  will  aid  understanding  and 
hasten  user  decision-making,  for  example  by  focusing  analyst  attention  on  areas  likely  to  contain  targets) 
(through  the  display  of  the  surface  material  features). 

VIA  also  supports  perspective  viewing  and  zooming  from  any  viewpoint.  Viewing  from  different  and, 
occassionaily,  unusual  vantage  points  may  assist  users  in  finding  things  which  might  not  have  otherwise 
been  found.  Some  have  observed  that  users  tend  to  take  too  much  time  to  make  a  decision  about  image 
content  when  using  the  tool,  however.  This  behavior  may  be  caused  by  the  availability  of  interactive 
exploration  capabilities,  such  as  perspective  viewing  and  zooming.  These  features  give  the  user  more 
freedom  and  capabilities  which  he  or  she  may  wish  to  exploit  fully  before  making  a  decision  on  image 
content 

VIA  also  allows  the  user  to  synthesize  images  from  CAD  models  (created  with  the  Army's  Ballistic 
Research  Laboratory  CAD  package  (BRL-CAD).  The  illumination  parameters  for  these  synthetic  target 
images  can  be  adjusted  for  sun  position.  Shadows  cast  by  these  objects  can  also  be  rendered  in  the  image, 
although  sun  position  does  not  affect  the  rendering  of  the  (non-synthetic)  imagery.  The  synthetic  images 
can  also  include  opaque  or  semi-translucent  camouflaging  through  simulation  of  the  appearance  of  netting. 
The  user  is  able  to  observe  these  synthetic  images  and.  presumably,  compare  them  with  the  image  that  is  to 
be  analyzed. 

3.2  VIA  Limitations 

Products  that  include  graphical  user  interfaces  generally  must  follow  one  of  two  philosophies  in  their 
human-computer  interaction.  Specifically,  the  interaction  must  be  either  very  similar  to  the  interaction 
model  of  other  products  which  provide  generally  similar  functionality  or  the  interaction  must  be  completely 
different  from  other  products  that  provide  generally  similar  functionality  [13].  Products  with  dissimilar 
interfaces  tend  to  be  more  difficult  for  new  users  to  grasp,  thus  it  is  generally  held  that  it  is  advantageous 
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for  graphical  user  interfaces  to  be  similar,  that  is.  to  follow  a  standard  [12.  13].  The  VIA  product  conta 
several  features  that  do  not  conform  to  generally  accepted  Motif  standards  and  also  has  several  defidenc 
in  its  GUI. 

One  of  VIA’s  weaknesses  is  that  it  is  not  strongly  compliant  with  the  Motif  Style  Guide.  Its  usage 
File  and  “Quit”  buttons  on  its  main  image  canvas  is  not  strictly  compliant  with  Motif  standards  a 
also  differs  from  general  practice  in  graphical  user  interfaces.  In  other  GUIs,  these  commands  are  m 
typically  invoked  from  a  menu  bar  at  the  top  of  the  window  or  canvas  that  contains  the  image  (or  dnnmw 
or  drawing,  depending  on  the  application).  Further,  in  VIA  these  buttons  are  physically  grouped  with  od 
commands  that  have  different  logical  purposes.  The  other  buttons  that  File  and  Quit  are  grouped  withal 
have  different  behaviors.  Ia  GUIs,  physically  adjacent  buttons  should  follow  similar  steps  in  accomptahi 
their  tasks  and  should  perform  topically  similar  tasks 1 .  The  other  buttons  next  to  “File”  and  “Quit”  alien 
pop-up  windows  to  appear  on  the  screen.  The  pop-up  windows  control  the  illumination,  viewpoint,  overt 
parameters,  etc.,  of  the  main  display  window.  Since  the  “File”  and  “Quit”  buttons  behave  differently  (tfc 
do  not  bring  up  any  windows  or  menus  or  cause  any  dialog  box  to  be  produced,  and  their  behavior 
somewhat  transparent  to  the  user2),  they  should  be  physically  separated  (i.e.,  distant)  from  the  viewpo 
ojsd  illumination  control  buttons.  The  “File”  and  "Quit”  buttons  also  would  be  improved  if  they 
some  type  of  dialog  box  to  appear,  even  if  the  dialog  box  contained  merely  a  description  of  what  action  t 
button  was  causing  or  about  to  cause.  The  “File”  button  actually  causes  the  on-screen  image  to  be  saved 
a  file  while  the  “Quit”  button  causes  VIA  to  exit.  A  superior  usage  of  dialog  boxes  would  provide  the  tu 
with  a  cancel  and  confirm  option  before  executing  these  actions. 

The  current  interface  allows  the  viewpoint  to  be  changed  by  moving  six  sliders  which  govern  the  positi 
and  orientauon  of  the  plane  onto  which  the  image  is  projected  for  perspective  viewing.  This  interact! 
is  somewhat  cumbersome  as  it  is  difficult  to  position  the  view  plane  at  a  desired  pose.  It  is  also  diffia 
to  precisely  position  the  sliders:  if  the  user  clicks  on  the  slider  just  one  or  two  pixels  to  the  left  or  rig 
of  where  he  should  click  to  make  a  fine  adjustment,  the  slider  will  move  a  large  amount.  This  could 
improved  by  using  sliders  where  the  “micro”  and  "macro”  adjustment  areas  are  larger  and  more  cleat 
Remarked.  An  even  more  powerful  interface  modality  is  that  of  the  direct  manipulation  model,  perha 
the  most  powerful  mode  of  human-computer  interaction  [13].  For  example,  if  the  user  could  click  t 
mouse  on  the  view  plane  and  drag  the  plane  to  new  positions  and  orientations,  a  more  direct  manipulate 
muds  could  be  achieved.  For  newer  users  and  for  users  desiring  to  interactively  explore  imagery,  a  dirt 
manipulation  interactivity  mode  may  be  quite  useful.  A  more  natural  point-and-click  direct  manipulate 
interface  for  zooming  would  also  be  useful. 

For  example,  in  an  editing  or  word  processing  application,  all  “clipboard”  operations  -  (cut.  copy,  paste,  etc.)  are  group 
physically  close  together.  These  operations  have  similar  logical  purposes  and  behave  similarly  ( i.e..  the  user  uses  similar  steps 
executing  these  operations  and  the  machine  appears  to  carry  out  execution  of  the  operations  in  a  similar  and  consistent  manner). 

Thsr  Quit  button  does  cause  the  application  to  exit,  which  could  be  considered  to  be  non-transparent. 
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la  VIA’s  pop-up  windows  (triggered  by  the  “View”,  "Scene”,  and  “Options”  buttons),  there  are  buttons 
labelled  “Go”  and  “Quit.”  The  “Go”  button  causes  the  settings  in  the  pop-up  window  to  be  applied  as  new 
viewing  or  lighting  parameters.  In  many  GUIs,  this  functionality  might  be  achieved  with  a  button  labelled 
“Apply.’  The  “Quit”  button  actually  causes  the  pop-up  window  to  be  dismissed:  it  does  not  cause  the  VIA 
application  to  exit  In  many  GUIs,  this  type  of  functionality  is  provided  by  buttons  labelled  “Dismiss”  or 
“Cancel.” 

VIA  gives  the  user  some  control  over  illumination  of  the  synthetically  generated  target  images  through 
sliders  that  govern  the  elevation  and  azimuth  of  the  sun  in  the  sky.  This  allows  precise  control  over  all 
possible  sun  positions3,  however  this  interface  may  not  seem  as  natural  to  some  users  as  an  interfac&thtt 
allowed  the  user  to  choose  a  time  of  day  and  a  time  of  year  and  to  see  how  the  sun’s  position  at  tinttime 
affected  target  appearance.  If  the  interface  offered  two  methods  of  controlling  the  sun’s  position^  users 
could  set  illumination  parameters  using  whichever  technique  is  most  straightforward  for  them. 

The  current  VIA  tool  allows  only  a  small  number  of  targets  to  be  synthesized,  it  also  does  not  allow  the 
user  to  directly  load  new  target  models.  It  would  be  useful  if  a  capability  to  load  new  models  was  added  to 
the  VIA  tool.  When  images  of  target  models  are  synthesized,  the  VIA  tool’s  prior  zooming  and  projection 
plane  parameters  also  seem  to  change.  Usually,  the  target  is  synthesized  at  a  much  higher  resolution  than 
the  current  image  setting.  This  makes  it  somewhat  difficult  to  compare  the  synthetic  target  image  with 
regions  of  the  real  image  that  may  contain  the  target.  Finally,  the  targets  can  only  seem  to  be  synthesized  in 
a  small  number  of  orientations.  It  would  be  more  useful  if  the  targets  could  be  synthesized  in  any  positior 
and  orientation  and  in  different  environments  (e.g.,  a  target  partially  obscured  by  trees  or  brush). 

Visualization  products  that  allow  the  user  to  navigate  through  an  image  require  innovauve  ways  to  show 
current  local  position  in  a  lower-resolution  global  map.  Without  features  like  this,  users  tend  to  become 
“lost”  in  an  image.  VIA's  current  display  of  the  viewing  plane  overlaid  on  the  global  image  helps  the 
user  somewhat  in  navigaung  through  the  image,  but  the  projected  image's  direcuon  that  corresponds  to  the 
global  image’s  “up”  direcuon  is  not  always  apparent  Moreover,  as  the  viewing  plane  is  “zoomed”  inti 
the  image,  the  viewing  piane  grows  smaller  and  more  faint  in  the  image;  it  is  difficult  to  distinguish  the 
boundary  of  smaller  viewing  planes  from  image  structures.  This  problem  might  be  alleviated  by  displaying 
the  viewing  plane  in  a  different  color,  such  as  red.  or  by  displaying  the  boundary  with  wider,  bolder  lines 
(Currently  the  view  plane  is  an  off-white  or  beige  color  that  blends  in  with  the  image.) 

Data  should  also  be  presented  in  a  muldpie-window,  hierarchical  fashion  to  aid  user  understanding  of  tin 
images.  It  is  important  in  cartographic  and  image  visualization  products  to  avoid  a  “single  map  mentality,’ 
which  tends  to  be  a  limiting  mindset  for  understanding  the  data  [6].  One  way  to  organize  and  presen 
the  data  is  through  the  use  of  a  multi-resolution  pyramid.  This  may  allow  quicker  presentation  of  dab 

5VIA  also  allows  the  user  to  illuminate  the  targets  using  some  sun  positions  that  are  impossible,  such  as  a  sun  position  on  tht 
northern  horizon. 
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at  a  high  levei,  helping  the  user  locate  his/her  position  in  image  while  supporting  exploration  of  the  data 
at  a  low  level.  Such  an  approach  also  might  use  screen  real  estate  more  effectively.  As  has  been  noted 
previously,  on-screen  images  require  a  new  philosophy  for  data  presentation;  one  cannot  merely  utilize  a 
brute  force  conversion  of  a  map  into  pixels,  but  rather  information  must  be  presented  carefully  on  screen. 
It  is  important  to  not  overwhelm  the  user  and  also  to  not  under-utilize  the  screen  or  try  to  force  too  much 
information  onto  it  It  is  necessary  for  the  presentation  to  be  less  map-oriented  and  more  video-oriented. 

VIA  also  does  not  allow  the  user  to  change  image  contrast  settings  during  execution.  Image  contrast, 
the  choice  of  whether  to  overlay  the  image  on  a  terrain  elevation  map,  and  the  choice  of  which  image  file 
to  load  all  must  be  chosen  when  VIA  is  invoiced  and  these  choices  cannot  be  changed  without  quitting  «««* 
then  restarting  the  application.  These  options  should  be  changeable  during  program  execution. 

In  VLA,  the  overlay  of  images  on  terrain  currently  does  not  mate  use  of  color.  If  VIA  contained  an 
option  to  display  either  the  terrain  elevation  or  the  image  using  color,  some  users  might  find  it  ranter  to 
determine  elevation  or  to  see  new  features  in  the  data  overlaid  on  the  terrain.  It  is  quite  difficult  to  see  most 
terrain  features  in  the  current  implementation  of  VIA.  It  might  also  be  useful  if,  whenever  the  image  was 
being  overlaid  on  the  terrain,  the  user  could  toggle  display  of  the  image  on  and  off.  If  the  user  could  see 
the  terrain  and  the  image  separately  and  also  be  able  to  switch  rapidly  between  one  image  and  the  other,  the 
user  might  be  able  to  better  visualize  the  image  in  the  context  of  the  terrain. 

It  would  be  beneficial  if  visualization  functionality  was  built  into  the  IE2000  toolkit.  This  could  be 
accomplished  by  adding  support  within  VIA  or  other  visualization  tools  to  allow  the  easy  input  of  many 
image  types  into  the  visualization  tool.  Currently,  VIA  can  only  accept  images  that  are  in  MIVS  format, 
which  limits  its  applicability.  Visualization  tools  will  be  much  more  powerful  and  useful  if  they  are  able  to 
share  images  with  other  tools  in  the  1E2000  environment.  The  IP  Toolkit  image  processing  utilities  are  one 
example  ot  functionality  that  are  not  accessible  by  VIA  images  at  present. 

Currently,  when  images  in  VIA  are  zoomed  by  a  very  large  factor,  aliasing  artifacts  begin  to  appear. 
If  anti-aliasing  filters  were  included  in  VIA,  these  artifacts  could  be  minimized.  In  order  to  satisfy  any 
concerns  about  the  filtering  altering  the  fundamental  image,  it  would  probably  be  worthwhile  to  show  both 
the  non-anti-aliased  image  to  the  analyst  as  well  as  the  anti-aliased  image  to  the  analyst  for  comparison 
purposes. 

VIA  could  also  be  improved  by  the  following  enhancements.  ( 1 )  More  complete  lighting  models  could 
be  used,  including  the  modeling  of  clouds  and  other  atmospheric  effects.  (2)  The  user  might  be  given  the 
capability  to  add  in  their  own  surface  material  features.  (3)  A  capability  could  be  added  for  the  looped 
playback  ot  a  senes  ot  user-selected  image  views,  such  as  a  sequence  of  views  in  a  tiy-ihrough.  A  capability 
to  pause  or  play  a  portion  or  all  of  the  views  backwards  could  also  be  added.  (4)  It  may  also  be  useful  to 
provide  3D  stereo  views  of  imagery  to  increase  realism  and  user  understanding.  (5)  VIA  needs  to  be  able 
to  render  its  images  faster. 
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4  A  Framework  for  a  Graphical  User  interface  for  VIA 

In  this  section,  our  framework  for  an  improved  GUI  for  VIA  is  presented.  Our  framework  is  also  generically 
applicable  to  other  visualization  tools  for  image  exploitation.  We  also  present  a  summary  of  the  current 
status  of  our  GUI  development  work. 

We  have  begun  preliminary  development  of  an  extension  to  the  VIA  GUI.  The  extended  user  interface  is 
being  developed  using  the  Tcl/Tk  system,  a  free,  easily  obtained,  and  widely  available  library  that  operates 
over  X  Windows  developed  by  John  Ousterhout  (10],  Tel  is  an  acronym  for  tool  command  language.  It 
is  a  scripting  language  for  controlling  and  extending  applications.  Tk  is  a  toolkit  for  X  windows  which 
extends  the  core  Tel  with  commands  for  building  user  interfaces.  Tcl/Tk  is  an  open  software  solution, 
making  the  resultant  GUI  highly  portable  and  extendable.  Tcl/Tk  allows  fairly  rapid  prototyping  of  GUIs, 
although  there  is  somewhat  of  a  learning  curve  associated  with  mastering  the  system.  One  of  the  difficult 
concepts  in  Tcl/Tk  involve  local  variables  being  inactivated  whenever  user  button-press,  mouse-click,  or 
keystroke  input  are  processed.  Another  hurdle  to  mastering  Tcl/Tk  is  its  approach  to  evaluating  variables 
and  expressions,  which  often  seems  unusual  to  higher-level  language  programers  and  which  may  seem 
almost  counter-intuitive  for  some  programmers  steeped  in  languages  like  C  and  Fortran. 

One  of  the  difficult  parts  of  graphical  user  interface  design  is  that  interfaces  are  best  constructed  using 
an  iterative  design  philosophy  [131.  This  involves  a  series  of  design  steps,  followed  by  user  tests,  followed 
by  re-design  and  re-test.  Although  iterative  design  may  cause  lengthier  processes  to  create  a  final  product 
it  also  causes  the  user  to  be  a  more  intimate  part  of  the  development  process  and  tends  to  produce  product 
that  are  very  well-suited  to  the  end  users.  We  have  tried  to  seek  user  feedback  whenever  possible  from  VL4 
users  in  the  IE2000  work  area  and  from  those  involved  in  image  analyst  activities.  A  commercial-grade 
end  product  would  need  to  involve  the  user  even  more  mumatelv  than  we  were  able  to  do  so  and  woulc 
need  to  use  more  rigorously  designed  tests  of  user  acceptability. 

Our  accomplishments  in  developing  an  improved  user  interface  in  the  limited  amount  ot  time  available 
are  described  in  this  section.  None  of  our  additions  were  done  on  the  original  program  files,  rather  wt 
made  changes  to  a  copy  of  the  files  named  with  a  ‘-new”  suffix.  A  number  of  data  and  source  files  for  thi 
original  VIA  product  were  found  to  be  duplicated  in  multiple  directories,  and  we  did  reorganize  these  filet 
to  ease  the  maintenance  task  and  to  save  on  storage  space.  We  accomplished  the  reorganization  by  keepinj 
one  master  copy  of  each  duplicated  file  and  creating  links  from  where  each  other  rile  instance  had  been. 

One  of  our  changes  was  to  re-name  certain  button  names  to  reflect  more  standard  GUI  term  usage 
The  name  changes  also  better  reflect  the  behavior  of  the  buttons.  In  the  pop-up  windows  that  controllec 
perspective  viewing  and  illumination  parameters,  the  "GO”  button  was  renamed  to  "APPLY”  and  th< 
"QUIT’  button  was  renamed  to  "DISMISS."  These  changes  were  made  in  the  Motif  code  that  drives  VI/ 
rather  than  in  the  Tcl/Tk  front-end  to  VIA  where  most  of  the  rest  of  our  changes  occurred. 
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Figure  1:  Top  Menu 


The  Tcl/Tk  front-end  we  built  is  a  Motif-style  screen  and  is  strongly  Motif-compliant  This  shouli 
enable  users,  particularly  new  users,  to  master  VIA  quicker.  Modem  visualization  and  GUI  desig 
practices  both  recommend  that  the  user  interface  be  consistent,  easy  to  use,  and  have  a  clean  organization! 
structure  (4.  13].  In  our  Tcl/Tk  front-end,  we  re-organized  the  entire  VIA  GUI  layout,  making  File  an 
View  be  the  only  two  top-level  menu  options.  These  terms  are  customary  and  usual  in  the  X-Window 
application  world.  Our  top-level  screen  is  shown  in  Figure  1 .  Moreover,  our  goal  was  to  develop  a  shor 
well-defined,  logical  structure  to  aid  in  decision-making  and  in  the  easy  location  of  commands.  These  goal 
are  strongly  consistent  with  the  choice  of  a  menu-style  of  interface  a  good  choice  [4],  In  Motif  application 
which  follow  the  Motif  style  guide,  these  options  usually  are  selected  from  a  menubar  at  the  top  of  th 
screen.  Tear-off  menus  may  exist  underneath  these  or  any  other  top-level  options  although  tear-offs  at  tb 
topmost  level  are  not  generally  accepted  in  the  Motif  application  community.  The  tear-off  capability  i 
demonstrated  in  Figure  5.  which  shows  the  opened  View  window. 

Underneath  the  File  option,  we  added  standard  opuons  for  loading  files,  saving  files,  and  quitting.  Thes 
extensions  also  give  VIA  the  ability  to  save  to  any  file  that  the  user  chooses  and  to  change  to  a  new  loai 
dataset  without  exiting  the  program  (through  a  pop-up  window  opened  with  the  Load  selection  of  the  Fill 
menu).  The  pull-down  File  menu  is  shown  in  Figure  2.  Our  re-organization  of  the  layout  also  group 
logically  and  behavioraliy  similar  items  closely  together  on  the  screens.  Logically  and/or  behaviorall; 
distinct  menus  or  objects  are  spatially  separated  on  the  screen.  We  also  supported  keyboard  selection  o 
menu  choices.  Accelerator  keys  could  be  easily  added  to  our  Tk  front-end.  Within  the  File  menu,  we  hav 
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Figure  2:  File  Pull-down  Menu 


also  built  in  the  capability  for  the  user  to  enter  a  save  filename  through  the  Save  option.  This  is  implemented 
using  a  pop-up  window,  shown  in  Figure  3..  .  • 

In  the  pop-up  Load  window,  we  have  added  a  capability  for  the  user  to  enter  an  image  filename  with  a 
browser  (currently,  input  image  names  are  hard-wired  into  VIA  in  the  start-up  script  and  there  is  no  way  to 
change  files  in  the  middle  of  a  session).  We  also  allow  the  user  to  choose  from  menu  if  VIA  will  use  terrain 
overlay,  elevation  scaling,  and  feature  overlay.  In  addition,  the  user  can  choose  which  contrast  option  to 
use.  These  features  add  much  flexibility  and  makes  for  a  clean,  powerful  interface  with  user.  The  pop-up 
Load  window  is  shown  in  Figure  4. 

Additionally,  we  allow  the  user  to  cancel  their  selections  for  the  options  in  the  pop-up  menus.  Any 
settings  changed  after  screen  invocation  but  prior  to  the  cancel  will  be  rolled  back  to  their  settings  before 
screen  invocation. 

A  number  of  items  would  have  been  tackled  if  we  had  more  time.  These  include: 

•  Make  the  file  browser  click-sensitive  for  loading  subdirectories. 

•  Actually  use  the  filename  that  the  user  entered  in  the  Load  and  Save  menus  (right  now.  these  are  only 
demonstration  features  due  to  the  shortness  of  time  for  this  project). 

•  Make  the  Tcl/Tk  quit  button  destroy  the  (currently  separate)  X/Motif-  driven  VIA  process. 

•  Remove  the  LOD  button  from  the  pop-up  X/Motif  menu.  This  button  does  nothing  in  the  Grumman- 
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Figure  3:  Save  menu 


supplied  VIA  program. 

•  Add  a  capability  to  click  on  locations  in  the  image  and  find  the  corresponding  position  in  geographic 
coordinates. 

•  Remove  the  viewing  and  illumination  buttons  from  the  VIA  Motif  window,  replacing  them  with  the 
menu  options  in  the  View  menu  (shown  in  Figure  5). 

•  Have  the  new  VIA  application  write  its  main  image  output  onto  the  Tcl/Tk  window  rather  than  opening 
a  separate  window.  (This  could  be  achieved  by  allowing  the  Tcl/Tk  front-end  to  communicate  with 
the  X/Motif  application.  This  is  possible,  although  it  involves  mastering  some  deeper  technical 
details  of  Tcl/Tk  that  we  did  not  have  time  to  resolve.)  The  main  window  of  Grumman ’s  VIA  tool  is 
shown  in  Figure  6.  Figure  7  shows  the  pop-up  window  for  selecting  VIEW  parameters. 

5  Future  Uses  of  Visualization  in  Image  Exploitation 

Visualization  techniques  have  a  strong  potential  to  benefit  image  exploitation  applications.  Some  of  the 
visualization  features  which  could  be  added  to  future  visualization  tools  for  image  exploitation  are  discussed 
in  this  section.  Some  of  the  techniques  VIA  tool  provide  a  starting  point  from  which  several  applications 
could  be  spun  off,  for  both  image  analysis  and  training  applications.  Three  potential  training  applications 
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Figure  4:  Opened  Load 
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Figure  5:  View  menu 
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Figure  6:  Image  associated  with  selected  view 
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for  visualization  include  training  of  image  analysts,  mission  rehearsal,  and  training  tor  battle  damage 
assessment.  Actual  battle  damage  assessment  activities  would  probably  also  benefit  from  visualization 
capabilities. 

Interactive,  exploratory  visualization  tools  may  prove  useful  in  image  analysis  to  provide  a  capability  to 
more  fully  explore  image  data.  One  example  of  exploratory  visualization  capability  in  the  VIA  tool  is  the 
overlay  of  DTED  and  DEAD  features.  These  help  provide  scene  context,  easing  the  task  of  locating  some 
features.  For  example,  some  roads  were  often  not  readily  apparent  in  the  image  yet  when  DFAD  feature 
overlays  were  added,  analysts  reported  that  it  was  easier  to  locate  road  features.  However,  the  current  VIA 
tool  only  allows  feature  overlays  on  its  most  gross  display  (that  is,  it’s  lowest  resolution,  most  “global? 
view);  the  zoomed-in  views  and  the  perspective  views  do  not  allow  any  feature  overlays.  If  feature  o verity* 
were  supported  for  zoomed-in  views,  it  would  probably  be  necessary  to  address  the  mismatch  of  feature 
fiat  a  resolution  to  terrain  and  image  resolution.  Feature  data  is  generally  stored  at  much  lower  resolution 
than  other  image  data.  A  number  of  techniques  have  been  presented  in  the  literature  that  could  address  the 
poor  registration  of  feature  to  image  data,  including  the  image  morphing  and  warping  of  computer  graphics 
as  well  as  the  physical-based  deformation  models  from  computer  vision.  The  mismatch  between  features 
and  image  data  would  look  quite  disconcerting  in  the  highest  resolution  (maximally  zoomed)  images  sad 
is  already  apparent  in  some  regions  of  VIA’s  “global”  image  display  window.  An  alternate  approach  t£ 
the  registration  problem  would  be  to  give  the  user  the  capability  to  move  or  adjust  the  overlays  in  ar 
interactive  fashion.  This  capability  would  have  to  allow  for  both  global  and  local  adjustments  because  tht 
mis-registration  is  not  merely  a  simple  global  rotation  and  translation;  the  mis-registration  is  different  it 
different  parts  of  the  image.  Additional  capabilities  that  aid  in  image  exploration  should  be  part  of  future 
visualization  efforts  for  image  exploitation  applications. 

Visualization  tools  could  also  be  used  to  aid  image  analysis  by  providing  techniques  that  assist  in  targe 
identification.  For  example,  the  shadow  cast  by  an  object  can  be  used  to  derive  3D  object  shape.  Automate 
algorithms  that  reconstruct  potential  object  profiles  from  shadows  in  the  image  could  assist  image  analyst 
in  determining  object  identity.  This  could  be  implemented  by  allowing  the  analyst  user  to  interactive!' 
segment  the  rough  boundary  of  a  shadow  from  an  image  and  then  use  a  semi-automatic  machine  algorithr 
to  complete  segmentation  and  to  begin  object  shape  detection  and  3D  synthesis  ot  potential  objects.  1 
might  be  possible  to  recognize  the  object  by  machine  as  well,  although  automatic  object  recognition  i 
more  difficult  and  perhaps  less  reliable  than  an  interactive  process  that  keeps  the  human  analyst  in  the  loof 
BRL-CAD  or  other  CAD  models.  DTED.  and  DFAD  could  be  used  to  aid  3D  shape  detection  and  then  use 
to  synthesize  the  object  in  the  image. 

For  future  visualization  tools  in  the  image  exploitation  arena,  capabilities  to  allow  interactive  determi 
nation  of  position  and  image  statistical  features  might  also  be  beneficial.  For  example,  it  the  user  coul 
click  on  an  area  of  the  image  with  the  mouse  and  be  able  to  see  a  side  window  pop  up  with  coordinat 
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information  (in  both  geographic  and  UTM  coordinates),  the  ability  to  navigate  through  the  image  and  to 
determine  precise  positioning  within  the  image  would  be  aided.  Furthermore,  if  a  similar  point-and-clidc 
capability  was  enabled  for  DFAD  feature  data,  the  user’s  ability  to  gather  useful  information  from  the  image 
would  also  be  aided.  The  functionality  termed  geographic  brushing  in  the  literature  [7]  might  also  be  useful 
in  future  products.  Geographic  brushing  means  the  display  of  image  region  features  (such  as  local  region 
histograms),  often  statistical  in  nature,  for  moveable,  selectable  regions  on  the  map  or  image.  Scatter  plot 
outputs  might  also  be  supported  to  display  the  nature  of  the  local  image  features. 

It  should  be  possible  to  correct  images  for  shadows  caused  by  sun  position4.  Through  normalization  of 
shadowed  images,  two  sun-normalized  images  of  a  pre-stored  3D  target  model  (probably  generated  from  a 
CAD  model)  and  an  image  could  be  presented  for  comparison  to  the  analyst  This  may  assist  the  analyst  ia- 
the  identification  of  image  objects. 

Image  analysts  may  also  benefit  if  future  software  tools  allowed  the  loading  and  display  of  images  of 
targets  in  other  images  from  prior  scenarios.  This  would  give  the  analyst  an  encyclopedic  ability  to  compare 
the  current  image  with  previous  images  and  subimages  containing  similar  suspected  targets/threats.  Patton 
analysis  or  template-matching  techniques  would  be  useful  to  screen  archived  image  data  and  in  the  selection 
of  images  for  viewing. 

It  would  also  probably  be  useful  to  attempt  to  display  data  that  can’t  normally  be  visually  seen,  such  as 
the  visibility  window  of  a  ground-based  radar  site  or  radar  signatures  from  different  orientations.  Other 
non-visual  data,  such  as  radioactivity  and  radio  activity  also  might  be  useful.  These  could  be  displayed  as 
colored  translucent  "domes'’  of  activity  overlaid  on  the  image  plane  or  by  other  useful  visual  means.  These 
displays  might  be  useful  both  for  image  analysis  and  for  mission  rehearsal. 

Visualization  tools  in  the  image  exploitation  arena  need  to  be  able  to  support  data  from  multiple  sources 
and  to  include  capability  to  overlay  those  data  types  on  top  of  each  other.  As  has  been  mentioned  earlier, 
the  issues  in  registration  of  data  are  not  trivial,  indeed  a  large  number  of  researchers  are  actively  working 
in  this  area.  In  image  exploitation,  the  capability  to  overlay  multiple  sensor  data,  including  photographic 
data,  infrared  data,  elevation  data,  and  radar  data  (such  as  synthetic  apperture  radar)  are  necessary. 

5.1  Detection  of  Change 

Another  capability  that  would  be  useful  in  the  image  exploitation  environment,  for  both  battle  damage 
assessment  and  for  detection  of  clandestine  activity,  is  that  of  displaying  enhanced  differences  in  images 
collected  over  time.  One  way  that  this  capability  might  be  provided  is  by  dickering  between  "old”  images 
and  “new"  images,  perhaps  allowing  an  analyst  to  detect  clandestine  activity.  Research  in  human  perception 
and  cognition  seems  to  indicate  that  humans  have  an  enhanced  ability  to  detect  image  differences  through 

'‘Both  tune  of  day  and  time  of  year  of  image  capture  will  affect  shadowing. 
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this  method  [11],  It  is  not  clear  if  VIA  or  any  other  visualization  tools  have  actually  been  applied  to  any 
extent  in  this  area.  Other  techniques  for  displaying  change  in  images  collected  over  time  might  involve  the 
use  of  statistical  pattern  recognition  to  detect  image  regions  that  differ  and  then  using  color,  height  maps, 
or  overlaid  (perhaps  translucent)  symbols  to  highlight  the  changed  regions.  Another  possibility  to  detect 
subtle  change  over  time  might  involve  the  use  of  local  region  histograms. 

Visualization  of  future  battle  scenarios  might  also  be  useful  for  battle  damage  assessment  and  for 
war-gaming  purposes.  If  possible  damages  from  munitions  could  be  simulated,  battle  planning  might  be 
aided  as  well  as  battle  damage  assessors  trained  prior  to  actual  battle  situations.  Thus,  visualization  could 
be  useful  in  a  training  role  for  battle  damage  assessment 

It  may  be  useful  to  couple  visualization  functions  with  the  S-MODALS  to  be  able  to  recognize  in»§e 
regions  that  have  suffered  battle  damage.  S-MODALS  may  also  be  useful  in  recognizing  targets  in  an. 
image  or  to  recognize  regions  of  change  in  images  over  time,  such  as  helping  determine  if  SAM  sites  have 
been  built  or  camouflaged  tanks  moved  into  a  region. 

5.2  Fly-throughs 

The  realistic  rendering  of  landscape  elements  as  overlays  on  images  or  maps  may  also  be  useful  in  image 
exploitation.  For  example,  pilots  often  use  waterways  for  navigation,  thus  it  may  be  useful  to  render 
waterways  in  a  photo-realistic  manner,  especially  for  mission  rehearsal  applications  A  capability  that 
provided  actual  texture  mapping  of  an  image  product  onto  a  terrain  elevation  model  would  probably  also  be 
a  useful  adjunct  to  the  photo-realistic  display  of  waterways.  Other  landscape  elements  such  as  trees,  torests, 
cities,  etc.,  could  be  rendered  using  fractal  techniques.  If  Vertical  Right  Obstruction  Data  was  available, 
power  lines,  towers,  chimneys,  etc.,  could  also  be  synthesized  for  mission  rehearsal  tlv-throughs.  Display 
of  minimum  flight  elevauon  (MFE)  data  as  a  translucent  "cloud”  over  the  image  might  too  be  useful  for 
this  applications.  The  interpolation  of  terrain  elevauon  might  also  add  to  the  ieeling  ot  photo-realism.  For 
real-world  use.  scenes  would  have  to  be  constructed  quickly  from  the  3D  terrain  model.  There  appears  tc 
be  a  significant  level  of  interest  from  many  commanders  to  train  for  missions  using  computer-generated 
views  and  fly-throughs.  Contour  mapping  of  terrain  elevation  data  may  also  be  useful  as  a  display  option 
for  some  applications. 

For  fly-throughs  or  drive-throughs  ti.e..  navigauon-style  applications),  it  would  be  useful  to  suppo* 
the  display  of  multiple  scene  views  simultaneously  on  the  screen  while  also  taking  care  not  to  clutter  the 
screen.  This  may  assist  the  user  in  determining  their  current  position  and  also  help  maintain  a  global  view 
of  the  data  and  a  memory  of  what  the  user  has  previously  visited.  Techniques  similar  to  this  have  beer 
recommended  by  some  in  the  cartographic  visualization  community  [6].  One  approach  might  be  to  let  the 
user  save  a  series  of  viewpoints  by  clicking  on  the  image  display  and  then  display  the  scenes  from  thos< 
viewpoints,  maintaining  the  same  projection  geometry  for  ail  views.  This  would  roughly  simulate  the  usei 
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moving  from  point  to  point  along  flight  paths.  It  may  also  be  useful  to  interpolate  views  between  points.  I 
a  few  rendering  short-cuts  can  be  used  in  “navigating”  through  the  different  scene  viewpoint,  near-real tim 
navigation  will  become  more  likely.  Rendering  “shortcuts”  that  involve  image  interpolation  may  not  b 
suitable  for  image  analysis  applications,  though,  due  to  the  possiblity  of  missing  some  scene  features.  Fo 
effective  navigation,  a  multiple  resolution  approach  to  data  representation  is  likely  also  required.  This  wii 
allow  the  user  to  achieve  both  global  navigation  and  local  navigation  [3].  Further,  use  of  text-based  aid 
and  navigation  buttons  seems  to  help  avoid  information  overload  f  1], 

One  concern  about  the  viability  of  fly-throughs  or  drive-throughs  might  be  the  speed  at  which 
can  be  rendered.  One  typical  approach  to  rendering  landscape  images  is  to  map  images  or 
a  polygonal  approximation  of  the  terrain  elevations.  Often,  this  can  produce  near-realtime 
However,  one  open  issue  in  the  overlay  of  photos  or  other  texture  maps  is  how  to  quickly  overlay  tnultipl 
texture  maps  onto  the  terrain  polygons  in  near-realtime.  To  achieve  near-realtime  operation,  it  may  b 
necessary  to  port  a  fly-through  to  a  parallel  machine.  If  there  are  few  3D  objects  to  render  and  little  31 
maneuvering,  however,  a  fast  serial  machine  may  be  adequate. 

5  J  Image  Analyst  Training 

Perhaps  one  of  the  most  useful  short-term  applications  for  visualization  for  image  analysis  is  in  the  trainin; 
of  image  analysts.  By  providing  access  to  powerful  visualization  toois  that  allow  analysts  to  see  a  larg 
collection  of  existing  images  of  various  targets  and  by  also  giving  analysts  the  capability  to  see  synthesize! 
images  of  targets  and  threats  in  new  scenarios,  it  should  be  possible  to  provide  a  large  and  useful  set  o 
training  imagery  for  the  beginning  analyst.  Analysts  could  thus  be  exposed  to  many  more  situations  that 
are  actually  readily  available  in  real  imagery.  Visualization  can  play  an  important  role  in  image  analys 
training. 

5.4  Finale 

No  report  on  visualization  would  be  complete  without  mentioning  that  before  final  adaptation  of  . 
visualization  product  in  a  non-experimental  setting,  it  would  be  necessary  to  run  several  experiments  witl 
many  users  and  images  and  to  then  verify  that  visualization  techniques  indeed  do  improve  productivity 
Carefully  constructed  visualization  tools  should  be  able  to  meet  the  challenge  of  improving  productivity  b] 
helping  end  users  discover  trends  in  data  that  were  not  apparent  before. 

The  focus  for  future  military  applications  will  be  the  fast  integration  of  multiple  sources  of  data.  It  wil 
be  necessary  to  put  data  together  quickly  and  effectively  from  multiple  sources.  Once  the  data  is  integrated 
it  will  be  necessary  to  quickly  pian  using  the  data.  It  may  also  be  necessary  to  quickly  train  using  th< 
data,  and  then  to  respond  quickly  to  emerging  threats.  Visualization  can  assist  in  providing  the  requisite 
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interactivity,  connectivity,  and  integration  required  in  future  military  enterprises.  Visualization  has  the 
potential  of  assisting  both  the  operators  in  the  field  and  the  planners  at  headquarters. 


Addendum:  Extension  of  S-MODALS  ANN 

The  S-MODALS  technology  has  been  used  for  target  recognition,  for  medical  diagnostic  applications, 
and  for  justice  applications.  This  technology  has  been  found  to  be  especially  useful  in  the  detection  and 
classification  of  pulmonary  tumors. 

One  weakness  of  the  ANN  software  is  that  it  lacks  a  graphical  user  interface  (GUI).  Recently,  a  TIM40 
board  with  three  C40  processors  has  become  available  to  Rome  Laboratory,  offering  the  opportunity  to 
explore  parallel  implementations  of  the  artificial  neural  net  technology. 

For  our  summer  work,  we  originally  proposed  to  investigate  parallel  realizations  of  the  artificial  neural 
network  on  the  TIM40  hardware.  Furthermore,  we  proposed  to  develop  a  graphical  user  interface  for  the 
ANN  software.  The  addition  of  a  GUI  to  the  S-MODALS  software  would  have  made  it  a  very  useftil  part 
of  the  IE  2000  environment 

A.1  Parallel  Realizations  of  the  ANN  S-MODALS  System 

Our  first  milestone  was  to  have  been  a  port  of  the  neural  network  algorithm  to  the  TIM40  environment. 
We  were  next  planning  to  investigate  the  effects  of  task  granularity  and  memory  contention  on  the  parallel 
algorithm  performance.  Our  goal  was  to  optimize  the  performance  of  the  algorithm  on  the  TIM40.  In 
addition,  we  were  to  attempt  to  improve  the  al  gorithm  so  that  its  performance  was  more  robust.  An  increased 
usage  of  knowledge  about  the  problem  environment  might  have  delivered  this  performance  improvement 

Complications  in  ANN  Research 

Some  of  the  difficulties  we  encountered  in  our  efforts  included: 

•  A  contractor  controlled  the  source  code  and  would  not  release  it  to  us.  We  were  unaware  of  this  when 
the  project  began. 

•  The  contractor  would  not  provide  timely  assistance  in  resolving  configuration  problems  in  porting 
the  software  executable  to  Rome  Lab. 

•  We  eventually  discovered  that  the  software  would  not  run  on  the  hardware  configuration  at  Rome 
Lab.  The  contractor  would  not  supply  the  changes  necessary  to  allow  the  software  to  run.  Due  to  the 
contractor  controlling  the  source  code,  we  were  unable  to  make  the  necessary  software  configuration 
changes  that  would  have  allowed  the  software  to  run  on  our  hardware. 
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We  did  not  have  a  cross-compiler  to  allow  us  to  generate  source  code  of  our  own  that  exercised  tl 
TIM-40  boards  at  Rome  Lab. 


•  We  switched  from  the  ANN  project  in  the  seventh  week  of  ten  at  Rome  Lab.  We  had  spent  a  great 
amount  of  time  and  effort  to  discover  and  resolve  the  configuration  problems  and  really  had  only  two 
weeks  to  focus  on  our  final  project  in  visualization. 
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CALIBRATION  OF  INFRARED  THERMOGRAMS  OF  ELECTROMAGNETIC  FIELDS 

John  D.  Norgard 
Professor/EC  E 

Department  of  Electrical  &  Computer  Engineering 
University  of  Colorado 

Abstract 

The  infrared  (IR)  imaging  technique  for  measuring  electromagnetic  (EM)  fields  was  further 
developed  this  summer  to  map  two-dimensional  EM  field  distributions  near  a  radiator  or  a  scattering 
body.  Both  electric  and  magnetic  field  intensities  have  been  measured  with  this  technique.  Initial  tests 
to  prove  the  validity,  accuracy  and  sensitivity  of  this  technique  were  performed  this  summer  in  the 
anechoic  chamber  at  the  Electromagnetic  Vulnerability  Analysis  Facility  (EMVAF)  at  Rome  Laboratory 
(RL).  Empirical  calibration  tests  were  performed  to  determine  the  absolute  intensity  level  of  the 
magnitude  of  the  electric  field.  Magnetic  field  intensity  levels  were  not  calibrated  this  summer. 

Experimental  tests  were  performed  on  a  conducting  cylinder  irradiated  by  an  incident  plane 
wave.  E-  and  H-  field  patterns  of  the  scattered/diffracted  energy  from  the  cylinder  where  measured. 
Tests  were  conducted  at  various  microwave  frequencies  relative  to  the  resonant  frequencies  associated 
with  the  cylinder  and  at  numerous  angles  of  incident  (eg.  end-on,  broad-side,  oblique-incidence)  and 
for  several  different  polarizations  of  the  incident  field  relative  to  the  axis  of  the  cylinder  (eg.  horizontal, 
vertical,  and  skewed).  Each  tests  was  performed  in  the  near  and  far  fields  of  the  cylinder.  IR 
thermograms  of  the  scattered  fields  were  taken.  The  equi-temperature  contour  levels  in  the  IR 
thermograms  are  being  compared  to  numerical  predictions  of  the  scattered  electric  field  intensity.  The 
numerical  code  was  developed  at  the  Jet  Propulsion  Laboratory  (JPL).  These  experimental  results  will 
be  used  to  validate  the  accuracy  of  the  JPL  numerical  code. 

The  imaging  technique  was  also  used  to  map  the  scattered  electric  field  around  a  model  of  an 
aircraft.  A  plastic  FI  6  scale  model  was  constructed  and  sprayed  with  several  coats  of  silver  paint  to 
made  it  conductive.  IR  thermograms  of  the  magnitude  of  the  scattered  electric  field  intensity  were 
taken  in  the  horizontal  and  vertical  longitudinal  planes  through  the  fuselage  of  the  aircraft.  The  equi- 
temperature  contour  levels  in  the  IR  thermograms  are  being  compared  to  numerical  predictions  of  the 
scattered  electric  field  intensity.  The  numerical  predictions  are  being  performed  with  the  GEMACS 
code  at  Rome  Laboratory.  These  experimental  results  will  be  used  to  validate  the  accuracy  of  the 
GEMACS  numerical  code. 

The  lossy  IR  detector  screen  was  irradiated  by  a  normally  incident  EM  wave  in  the  near  and  far 
fields  of  the  antenna.  The  level  of  the  incident  electric  field  was  measured  using  D-dot  probes  and  was 
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also  predicted  theoretically  using  the  Friis  Transmission/Reception  Formula.  The  known  field  radiated 
from  the  standard  gain  horn  antenna  was  used  to  calibrate  the  measured  temperature  levels  in  the  IR 
detector  screen.  A  correlation  of  the  measured  temperature  level  (a  selected  color  on  the  IR 
thermogram)  to  the  measured  or  predicted  magnitude  of  the  incident  electric  field  was  used  to  calibrate 
the  IR  technique.  A  table  of  color  level  vs.  incident  field  level  was  made.  This  table  is  the  calibration 
curve  for  the  IR  detector  screen  used  in  the  calibration  test.  Different  IR  detector  screens  with 
different  sensitivities  can  all  be  calibrated  using  this  same  technique. 

Four  papers  were  presented  at  international  IR  and  EM  conferences  this  summer;  two  seminars 
were  also  presented  on  the  technique.  One  paper  was  published  in  an  IR  journal;  three  new  papers 
have  been  submitted  for  presentation  at  several  IR  conferences  next  year. 
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CALIBRATION  OF  INFRARED  THERMOGRAMS  OF  ELECTROMAGNETIC  FIELDS 


John  D.  Norgard 

An  infrared  OR)  measurement  technique  is  being  developed  to  measure  electromagnetic  (EM) 
fields.  This  technique  uses  a  minimally  perturbing,  thin,  planar  IR  detection  screen  to  produce  a 
thermal  map  of  the  intensity  of  the  EM  energy  over  a  two-dimensional  region.  EM  fields  near  radiating 
microwave  sources  and  scattering  bodies  can  be  measured  with  this  technique.  This  technique  also 
can  be  used  to  correlate  theoretical  data  with  experimental  observations  and  to  experimentally  validate 
complicated  numerical  codes  which  predict  electric  field  distributions  inside  waveguide  cavities  <E- 
Fields)  and  surface  current  distributions  on  metallic  surfaces  (H-Fields). 

1 .  Introduction 

A  non-destructive,  minimally  perturbing  IR  measurement  technique  is  being  developed  to 
observe  EM  fields.  Metallic  surface  currents  and  charges  also  can  be  measured  with  this  technique. 

This  IR  measurement  technique  produces  a  two-dimensional  IR  thermogram  of  the  electric  or 
magnetic  field  being  measured,  i.e.  a  two-dimensional  isothermal  contour  map  or  a  gray  scale  of  the 
intensity  of  the  EM  field. 

Electric  and  magnetic  fields  can  be  measured  separately.  For  example,  electric  field  patterns 
radiated  from  microwave  horn  antennas,  electric  field  intensities  coupled  through  apertures  in  shielded 
enclosures,  diffraction  patterns  of  electric  fields  scattered  from  complicated  metallic  objects,  and 
electric  field  modal  distributions  induced  inside  cylindrical  waveguide  cavities  can  be  measured.  Also, 
magnetic  field  distributions  near  conductive  surfaces  and  induced  surface  currents  on  metallic  surfaces 
can  be  determined. 

The  advantages  and  disadvantages  of  this  new  IR  measurement  technique  are  discussed  later. 

2.  IR  Measurement  Technique  (Overview) 

The  IR  measurement  technique  is  based  on  the  Joule  heating  that  occurs  in  a  lossy  material 
as  an  EM  wave  passes  through  the  material.  A  thin,  planar  sheet  of  lossy  carbon  loaded  paper  can  be 
used  to  map  electric  fields;  a  thin,  planar  sheet  of  lossy  ferrite  loaded  epoxy  can  be  used  to  map 
magnetic  fields.  In  either  situation,  the  absorbed  heat  energy  is  converted  into  conducted  and 
convected  heat  energy  and  into  re-radiated  EM  energy.  The  radiated  EM  energy  is  concentrated  in  the 
IR  band.  This  "black  body"  energy  can  be  detected  with  an  IR  (Scanning)  Array  or  with  an  IR  (Starring) 
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Focal  Plane  Array  (FPA). 


2.1 .  IR  Experimental  Setup 

This  technique  involves  placing  a  lossy  non-perturbing  IR  detection  screen  in  the  plane  over 
which  the  EM  fields  are  to  be  measured. 

2.1.1.  Electromagnetic  Parameters 

The  detector  screen  can  be  made  from  a  thin  sheet  of  linear,  homogenous  and  isotropic  but 
lossy  material.  From  the  complex  form  of  Poynting's  Theorem  for  a  linear,  homogeneous  and  isotropic 
material,  the  absorbed  power  Pab,  within  a  given  volume  V  of  the  lossy  material  is  a  function  of  the 
electric  (E)  and  magnetic  (H)  field  intensities  inside  the  screen  and  is  given  by 

Pabs  =  f  (oE2+toe,,Ez  +  (j)\l,/H2)  dV  [w/m2]  (1) 

where  a  is  the  conductivity  of  the  detector  screen,  e~  is  the  imaginary  component  of  the  permittivity 
of  the  detector  screen,  fj~  is  the  imaginary  component  of  the  permeability  of  the  detector  screen,  and 
w  is  the  radian  frequency  of  the  incident  field.  The  volume  integral  is  over  the  illuminated  portion  of 
the  detector  screen.  The  spectral  characteristics  of  the  complex  constitute  parameters  (//,  e,a)  of  the 
detector  material  must  be  known  (or  measured)  over  the  entire  frequency  bandwidth  to  be  measured. 

The  incident  EM  energy  is  absorbed  by  the  lossy  material  and  is  converted  into  thermal  heat 
energy  which  causes  the  temperature  of  the  aetector  material  to  rise  above  the  ambient  temperature 
of  the  surrounding  background  environment  by  an  amount  which  is  proportional  to  the  local  electric 
and/or  magnetic  field  intensity  (energy)  at  each  point  (pixel)  in  the  screen  material.  In  regions  where 
the  fields  are  strong,  the  absorbed  energy  is  large  and  the  resulting  pixel  temperatures  are  high;  in 
regions  where  the  fields  are  weak,  the  absorbed  energy  is  small  and  the  resulting  pixel  temperatures 
are  low.  The  resulting  two-dimensional  temperature  distribution  over  the  surface  of  the  screen  can  be 
detected,  digitized  and  stored  in  the  memory  of  an  IR  camera.  The  temperature  distribution  on  the 
surface  of  the  screen  without  any  EM  energy  incident  on  the  screen  also  can  be  stored  in  the  memory 
of  the  IR  camera  as  a  ambient  background  reference  temperature  distribution.  The  difference  in  the 
temperature  distributions  at  each  pixel  (between  the  illuminated  and  the  non-illuminated  screen)  is  due 
to  the  effects  of  the  electric  or  magnetic  field  incident  on  the  screen  at  that  pixel  location. 

The  EM  effects  can  be  visualized  by  presenting  the  differenced  two-dimensional  temperature 
profile  as  a  false  coior  image,  where  cool  colors  (for  example  shades  of  blue)  represent  weak  areas  of 
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EM  energy  and  hot  colors  (for  example  shades  of  red)  represent  strong  areas  of  EM  energy.  The 
resulting  two-dimensional  false  color  image  is  called  an  IR  thermogram,  ie.  an  iso-temperature  contour 
map,  and  is  a  representation  of  the  electric  and/or  magnetic  field  distribution  passing  through  the 
screen  material. 

2.1.2.  Thermal  Parameters 

For  a  planar  sheet  of  detector  screen  supported  by  a  block  of  non-conducting  material,  eg.  a 
styrofoam  block,  the  thermal  problem  reduces  to  considering  only  the  radiative  and  convective  heat 
losses  from  the  surface  of  the  detector  material. 

The  convective  heat  loss  hc  is  approximated  by 

hc  =  ha  (T-T^)1-25  [ W/m 2]  (2) 

where  hD  varies  between  1 .4  and  1 .6.  The  radiative  heat  loss  h,  is  approximated  by 

hz  =  [W/m2]  (3) 

where  elr  is  the  detector  surface  emissivity,  alr  is  the  Stefan-Boltzman  constant  in  W/m2  -  K4,  and  the 
temperatures  are  in  degrees  Kelvin.  The  conductive  heat  loss  is  negligibly  small. 

2.1.3.  Thermal  Equilibrium 

The  heat  transfer  problem  in  the  detector  material  involves  solving  a  non-linear,  second  order 
differential  equation  in  both  space  and  time,  while  considering  radiative  and  convective  heat  losses 
from  the  surface  of  the  material,  conductive  heat  transfer  within  the  material,  and  the  EM  power 
absorbed  in  the  material  as  a  function  of  distance  into  the  material.  For  the  case  of  the  thin  screens 
considered  here,  the  temperature  is  initially  considered  to  be  constant  in  the  direction  normal  to  the 
surface  of  the  material,  so  that  the  conductive  term  normal  to  the  surface  of  the  screen  can  be  ignored 
and  the  power  absorbed  can  be  considered  independent  of  the  direction  normal  to  the  surface  of  the 
screen.  Also,  the  time  dependence  of  the  absorbed  heat  energy  is  ignored  for  the  steady-state 
solution. 

Relating  the  convective  and  radiative  heat  losses  in  equations  (2)  and  (3)  to  the  absorbed  power 
in  equation  (1),  results  in  the  following  equation  at  thermal  equilibrium: 

p^K+K  (4) 

For  a  properly  optimized  detector  screen,  thermal  equilibrium  is  achieved  in  just  a  few  seconds. 
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This  non-linear  thermal/electrical  equation  can  be  solved  for  the  electric  or  magnetic  field,  as 
a  function  of  the  material  temperature  T,  using  approximate  techniques. 

2.2.  Approximate  Solution 

Equation  (4)  is  highly  non-linear  for  large  temperature  variations  above  ambient,  due  to  the 
thermal  processes  of  convection  and  radiation.  However,  for  small  temperature  variations  of  only  a 
few  degrees  above  ambient,  equation  (4)  can  be  linearized  for  small  incremental  temperature  changes 
AT  =  T  -T,mb  above  the  ambient  temperature  Tamb. 

This  condition  of  small  temperature  variations  above  ambient  is  a  desirable  operational 
constraint,  since  this  is  also  the  requirement  for  small  absorption  of  the  EM  energy  passing  through  the 
screen,  which  equates  to  a  small  perturbation  of  the  incident  field  when  performing  the  measurement. 
For  this  minimally  perturbing  measurement  case,  an  almost  direct  linear  correlation  exists  between  the 
incremental  surface  temperature  AT  and  the  absorbed  electric  or  magnetic  field  intensity.  The  incident 
electric  or  magnetic  field  can  then  be  determined  from  a  solution  of  Maxwell's  Equations  (Fresnel's  and 
Snell's  laws)  for  an  EM  wave  incident  on  a  planar  film  of  lossy  material.  Therefore,  it  is  possible  to 
correlate  local  surface  temperature  variations  AT  to  E  or  H  field  intensities. 

For  the  scanning  camera  available  this  summer  at  Rome  Laboratory  to  make  the  IR 
thermograms,  temperature  differences  AT  as  small  as  0.09  °K  could  be  detected. 

Care  is  exercised,  therefore,  in  the  selection  of  the  screen  material  not  to  significantly  perturb 
the  electric  or  magnetic  field  by  the  presence  of  the  lossy  material.  The  screen  can  be  designed  to 
absorb  from  1  %  to  5%  of  the  incident  power  and  to  produce  a  temperature  change  of  less  than  a  few 
degrees.  The  constitutive  parameters  of  the  IR  detector  screen  can  be  optimized  to  produce  a  large 
temperature  rise  in  the  detector  material  for  a  small  amount  of  absorbed  energy. 

Electric  and  magnetic  fields  produced  by  Continuous  Wave  (CW)  sources  operated  in  the 
sinusoidal  steady-state  mode  are  easy  to  measure  because  of  the  large  amount  of  energy  contained 
in  the  wave.  Transients  produced  by  high  power  microwave  (HPM)  pulsed  sources,  especially 
repetitively  pulsed  sources,  also  can  be  measured,  if  the  average  energy  content  in  the  pulse  is  high 
enough  to  raise  the  temperature  of  the  detector  screen  material  above  the  minimum  temperature 
sensitivity  of  the  IR  camera.  The  thermal  mass  of  the  detector  material  should  hold  the  absorbed  heat 
energy  long  enough  to  capture  the  IR  thermogram  of  the  pulse. 

2.3.  IR  Detector  Screen 

Referring  to  equation  (1 ),  the  screen  material  can  be  tailored  to  respond  to  only  one  component 
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of  the  field,  e.g.  by  optimizing  the  values  of  the  electrical  conductivity  a  and  the  imaginary  part  of  the 
permittivity  e"  of  the  material  relative  to  the  imaginary  part  of  the  permeability  //"  of  the  material,  the 
detector  screen  can  be  made  sensitive  either  to  the  tangential  component  of  the  electric  field  or  to  the 
tangential  component  of  the  magnetic  field  in  the  plane  of  the  screen. 

For  example,  an  electric  field  detector  screen  can  be  constructed  either 
i)  from  a  lossy  material  with  a  high  conductivity  a  and  a  low  imaginary  permittivity  e"  and  a  low 
imaginary  permeability  //" 

or  ii)  from  an  electrically  polarizable  material  with  a  high  imaginary  permittivity  e"  and  a  low 

conductivity  a  and  a  low  imaginary  permeability  //". 

Alternatively,  a  magnetic  field  detector  screen  can  be  constructed  from  a  magnetically 
polarizable  (magnetizable)  material  with  a  high  imaginary  permeability//"  and  a  low  conductivity  a  and 
a  low  imaginary  permittivity  e" . 

The  optimization  of  the  thermal  and  electrical  parameters  of  the  detection  screen  material 
should  be  guided  by  a  thermal/electrical  computer  code  based  on  a  plane  wave  normally  incident  on 
a  planar  interface  between  air  and  the  lossy  detector  material.  Other  absorptive  and  re-emittive 
transducing  materials  have  been  studied  for  use  as  passive  thermal  screens  for  IR  thermograms. 

2.3.1  Electric  Field  Detector  Screen 

For  the  detection  of  electric  fields,  the  IR  detector  screen  can  be  made  from  a  planar  sheet  of 
lossy  thin-film  material.  Several  different  detector  screens  were  used  to  make  electric  field 
thermograms.  One  was  a  carbon  loaded  thin  film  (eg.  Teledeltos  Paper)  80  //m  thick  with  a 
conductivity  of  8  mhos  per  meter.  The  other  screens  were  made  from  carbon-loaded  Kapton  films. 
The  films  were  loaded  with  different  resistivities  per  square.  These  materials  are  non-polarizable  and 
non-conducting;  therefore,  the  imaginary  components  of  the  permittivity  e"  and  the  permeability//"  are 
negligibly  small.  For  these  non-conducting,  non-polarizable,  non-magnetic  screen  materials,  maximum 
heating  occurs  due  to  the  electric  field  and  negligible  heating  occurs  due  to  the  magnetic  field. 

For  plane  waves  normally  incident  on  this  carbon  loaded  electric  field  detection  screen  and  for 
an  IR  FPA  with  a  temperature  sensitivity  of  0.01  °K,  electric  fields  with  a  magnitude  on  the  order  of 
61 .4  V/m  (1  mW/cm2  of  incident  power)  can  be  detected.  This  result  was  obtained  empirically  from 
experimental  data  in  which  the  incident  power  level  was  incrementally  decreased  until  no  electric  field 
contour  lines  on  the  IR  thermogram  were  discernable  from  the  ambient  background  level. 
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2.3.2  Magnetic  Field  Detector  Screen 


For  the  detection  of  magnetic  fields,  an  IR  detection  screen  can  be  fabricated  using  a  ferrite 
loaded  epoxy.  Several  different  ferrite  powers  have  been  tested,  viz.  Nickel  Zinc  Copper  Ferrite,  Iron 
Silicide  Ferrite,  Ferrite  50,  etc.  The  best  detector  screen  used  to  make  magnetic  field  thermograms 
had  an  80%  by  weight  mixture  of  Ferrite  50  with  a  thickness  of  0.5  mm  and  an  imaginary  relative 
permeability  fj"  of  approximately  2.  The  imaginary  component  of  the  permittivity  e"  and  the 
conductivity  a  were  negligibly  small.  For  this  magnetic,  non-conducting,  non-electric  screen  material, 
maximum  heating  occurs  due  to  the  magnetic  field  and  negligible  heating  occurs  due  to  the  electric 
field. 

In  the  fabrication  of  a  magnetic  screen,  it  is  difficult  to  keep  the  electric  polarizability  of  the 
epoxy  base  material  from  contributing  to  the  imaginary  component  of  the  permittivity  e"  of  the 
composite  and  .thereby,  introducing  additional  electric  polarization  losses.  If  the  electric  properties  of 
the  ferrite  are  not  carefully  controlled  and  minimized,  the  absorbed  power  will  be  due  to  both  the 
electric  and  magnetic  components  of  the  EM  field,  as  given  by  equation  (1 ).  In  this  case,  it  is  difficult, 
if  not  impossible,  to  separate  the  two  coupling  mechanisms  from  each  other  and  to  calibrate  the 
technique. 

To  detect  surface  currents  on  a  metallic  structure,  the  magnetic  detection  screen  can  be  placed 
in  direct  contact  with  the  surface  of  the  metal.  The  thickness  of  the  detector  screen  must  be  kept  to 
a  minimum,  so  that  the  magnetic  field  that  is  measured  on  the  outer  surface  of  the  screen  is  a  direct 
reflection  of  the  magnetic  field  on  the  inner  surface  of  the  screen  in  contact  with  the  surface  of  the 
metal.  The  material  can  be  held  flat  against  the  surface  of  the  metal  using  a  non-conducting  glue. 

The  surface  current  is  related  to  the  tangential  component  of  the  magnetic  field  intensity  on 
the  surface  of  the  metal  by  the  magnetic  boundary  condition  on  a  perfect  conductor: 


n  x  3  \t  = 

where  n  is  the  normal  to  the  surface  S.  The  magnetic  field  is  perpendicular  to  the  direction  of  the 
surface  current. 

2.4  IR  Camera 

The  temperature  difference  between  the  screen  material  and  the  background  is  detected, 
digitized,  and  stored  in  the  memory  of  an  IR  camera  on  a  pixel  by  pixel  basis.  The  IR  imaging  system 
used  to  take  the  IR  thermograms  has  approximately  200  by  100  pixels  per  frame  of  data.  The  detector 
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is  scanned  over  the  image.  The  detector  is  a  Mercury-Cadmium-Teluride  (HgCdTe)  IR  photo  detector 
operated  in  a  photo  voltaic  mode.  The  detector  operates  at  liquid  Nitrogen  temperatures.  The  camera 
can  detect  temperature  differences  of  approximately  0.09  °K,  and  has  a  relative  accuracy  of  plus  or 
minus  10%  when  detecting  EM  fields. 

2.5  IR  Images 

The  stored  IR  thermogram  data  represents  the  temperature  distribution  over  the  extent  of  the 
detector  screen  and  is  a  map  of  the  intensity  of  the  electric  or  magnetic  field  distribution  absorbed  in 
the  screen.  For  small  temperature  rises  less  than  a  few  degrees  above  ambient,  the  electric  and 
magnetic  field  intensities  are  nearly  linearly  proportional  to  the  temperature  change. 

2.5.1  Spatial  Resolution 

The  spatial  resolution  of  the  IR  thermogram  is  a  function  of  the  number  of  pixel  elements  in  the 
IR  camera,  and  is  fixed  by  the  angular  resolution  of  the  telephoto,  regular  or  wide  angle  lens  used  on 
the  IR  camera  when  making  the  IR  image.  A  telephoto  lens  can  be  used  to  look  at  small  details  in  the 
field  structure  on  the  detector  screen;  a  wide  angle  lens  can  be  used  to  look  at  large  scale  trends  in 
the  field  structure  in  the  detector  screen. 

The  telephoto  lens  also  has  the  added  advantage  for  regular  field  mapping  applications  of 
allowing  the  IR  camera  to  be  located  far  away  from  the  object  under  test  and,  thus,  removing  any 
perturbing  effects  that  the  metallic  structure  of  the  camera  might  have  on  the  field  distribution  being 
measured. 

2.5.2  Thermal  Resolution 

The  thermal  resolution  of  the  IR  thermogram  is  a  function  of  the  digitizer  in  the  IR  camera.  The 
IR  camera  used  to  take  the  IR  thermograms  has  a  12  bit  digitizer.  For  a  12  bit  digitizer,  the 
temperature  range  seen  by  the  IR  camera  is  divided  into  256  increments.  Each  digitized  increment  is 
assign  a  unique  color,  resulting  in  a  temperature  resolution  of  256  color  levels. 

2.5.3  Thermogram  Errors 

The  resulting  IR  image  of  an  EM  field  depends  on  the  combined  EM  and  thermal  properties  of 
the  detector  material  and  is  subject  to  several  significant,  but  controllable,  errors. 
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2.5.3. 1 


Lateral  Conduction  Effects 


Conductive  heating  in  the  transverse  direction  within  the  screen  material  causes  thermal 
"bleeding"  from  the  hot  spots  on  the  screen  to  nearby  cold  spots.  This  thermal  bleeding  tends  to  fill 
in  the  nulls  (minimums)  somewhat,  whereas,  the  areas  of  maximum  heating  (peaks)  are  not  effected 
very  much  by  this  effect.  This  effect  can  be  minimized  by  operating  at  small  temperature  variations 
above  ambient. 


2.5.3. 2  Lateral  Convective  Effects 

Convective  heating  of  the  top  of  the  screen  from  heat  rising  from  the  bottom  of  the  screen 
causes  the  top  of  the  screen  to  appear  slightly  hotter  than  the  bottom  of  the  screen.  This  "blurring" 
of  the  image  can  be  kept  to  a  minimum  by  operating  at  small  temperature  variations  above  ambient. 

This  blurring  effect  can  be  eliminated  completely  by  placing  the  IR  detector  screen  in  a 
horizontal  position  and  observing  the  image  with  the  IR  camera  looking  down  on  the  screen  from  above 
or  from  the  side  using  an  IR  mirror. 

2.5.4  IR  Measurement  Accuracy 

The  accuracy  of  the  IR  measurement  technique  was  demonstrated  by  performing  two  simple 
experiments  with  known  theoretical  solutions.  In  one  experiment,  the  diffraction  pattern  from  a 
"Lloyd's  Mirror"  was  measured.  In  the  other  experiment,  the  diffraction  pattern  from  an  "knife  edge" 
conducting  half-plane  was  measured. 

In  the  Lloyd's  mirror  experiment,  the  resulting  diffraction  pattern  is  due  to  the  antenna 
interfering  with  its  image  in  a  large  ground  plane.  The  near  field  (Fresnel  Zone)  antenna  pattern  of  the 
horn  antenna  was  used  to  obtain  the  theoretical  results. 

In  both  experiments,  the  screen  material  was  optimized  to  measure  only  the  tangential 
component  of  the  electric  field  intensity  in  the  plane  of  the  screen. 

These  experiments  were  performed  in  an  anechoic  chamber.  Good  correlations  between  theory 
and  experiment  were  obtained.  The  worst  errors  occurred  in  the  minimums  (deep  nulls)  of  the 
diffraction  patterns  were  thermal  bleeding  from  the  surrounding  hot  areas  tended  to  obscure  the  real 
depth  of  the  minimums.  Some  thermal  bleeding  out  of  the  maximums  into  the  surrounding  areas  also 
occurred,  obscuring  the  real  height  of  the  maximums. 

Even  with  conductive  bleeding  and  convective  blurring  of  the  image,  the  measurement  error 
is  less  than  approximately  10%  under  controlled  test  conditions. 
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2.6  IR  Advantages  and  Disadvantages 


The  IR  measurement  technique  provides  a  quick  and  accurate  method  to  observe  EM  fields  in 
a  two-dimensional  plane.  However,  only  the  magnitude  of  the  electric  or  magnetic  field  is  measured; 
no  phase  information  is  detected.  Also,  since  this  technique  is  based  on  the  thermal  mass  of  the 
detector  material,  high  energy  is  required  to  produce  good  thermal  images  of  EM  fields. 

3.  IR  Thermograms 

Examples  of  IR  measurements  taken  this  summer  at  Rome  Laboratory  are  now  presented. 

3.1  Scattering  from  a  Cylinder 

Experimental  tests  were  performed  on  a  conducting  cylinder  irradiated  by  an  incident  plane 
wave.  E-  and  H-  field  patterns  of  the  scattered/diffracted  energy  from  the  cylinder  where  measured. 
Tests  were  conducted  at  various  microwave  frequencies  relative  to  the  resonant  frequencies  associated 
with  the  cylinder  and  at  numerous  angles  of  incident  (eg.  end-on,  broad-side,  oblique-incidence)  and 
for  several  different  polarizations  of  the  incident  field  relative  to  the  axis  of  the  cylinder  (eg.  horizontal, 
vertical,  and  skewed).  Each  tests  was  performed  in  the  near  and  far  fields  of  the  antenna.  IR 
thermograms  of  the  scattered  fields  were  taken.  The  diffraction  patterns  of  the  electric  field  scattered 
from  the  cylinder  are  clearly  indicated  in  these  thermograms. 

The  equi-temperature  contour  levels  in  the  IR  thermograms  are  being  compared  to  numerical 
predictions  of  the  scattered  electric  field  intensity.  The  numerical  predictions  are  being  performed  by 
Dr.  Zuffada  at  Caltech's  Jet  Propulsion  Laboratory  (JPL).  These  results  will  be  used  to  verify  JPL's 
numerical  code  which  combines  the  Method  of  Moments  (MoM)  with  a  Finite  Element  Method  (FEM) 
to  determine  electric  field  intensity  levels. 

3.2  Scattering  from  an  FI  6  Scale  Model  Aircraft 

The  IR  imaging  technique  was  also  used  to  map  the  scattered  electric  field  around  a  simple 
scale  model  of  an  aircraft.  A  plastic  FI  6  scale  model  was  constructed  and  sprayed  with  several  coats 
of  silver  paint  to  made  it  conductive.  IR  thermograms  of  the  magnitude  of  the  scattered  electric  field 
intensity  were  taken  in  the  horizontal  and  vertical  longitudinal  planes  through  the  fuselage  of  the 
aircraft.  The  diffraction  patterns  of  the  electric  field  scattered  from  the  aircraft  and  the  scattering 
centers  are  clearly  indicated  in  these  thermograms.  Standing  waves  are  setup  between  the  incident 
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wave  and  the  reflected  wave  off  the  aircraft.  Surface  waves  are  established  along  the  side  of  the 
fuselage  and  the  front  edge  of  the  wings  between  the  nose  of  the  aircraft  and  the  missile  rails. 
Diffraction  occurs  off  the  wing  tips.  A  shadow  zone  appears  behind  the  aircraft. 

The  equi-temperature  contour  levels  in  the  IR  thermograms  are  being  compared  to  numerical 
predictions  of  the  scattered  electric  field  intensity.  The  numerical  predictions  are  being  performed  with 
the  GEMACS  code  at  Rome  Laboratory.  These  results  will  be  used  to  validate  the  accuracy  of  the 
GEMACS  numerical  code. 

4.  Calibration 

The  IR  measurement  technique  was  calibrated  this  summer  at  Rome  Laboratory. 

4.1  Relative  measurements 

In  the  IR  measurements  described  above,  only  the  relative  magnitude  of  the  EM  fields  have 
been  determined.  The  relative  accuracies  of  the  field  intensities  measured  by  the  IR  thermograms 
taken  with  the  IR  scanner  were  determined  using  numerical  computer  codes  to  predict  the  normalized 
field  intensities  for  the  same  geometries.  The  relative  accuracies  were  usually  no  greater  than  10% 
in  error.  These  errors  occurred  at  the  minimum  (null)  field  intensity  positions  in  the  thermograms, 
where  the  effect  of  thermal  conduction  (bleeding)  from  adjacent  hot  spots  tends  to  fill  in  the  nulls 
somewhat.  Overall,  at  positions  away  from  nulls,  the  technique  was  closer  to  only  1%  in  error. 
Therefore,  the  technique  has  the  potential  to  produce  extremely  accurate  field  intensity  measurements, 
much  less  than  ±  1  dB  in  error. 

4.2  Absolute  Measurements 

To  determine  the  absolute  electric  or  magnetic  field  strengths  from  IR  thermograms,  the  IR 
scanning  detectors  and  detector  screens  must  be  calibrated. 

Appropriate  theoretical  models  and  experimental  practices  are  being  developed  to  permit  the 
absolute  determination  of  electric  and  magnetic  field  intensities  from  thermographic  images  (IR 
thermograms)  of  microwave  fields. 

4.2.1  Theoretical  calibration 

A  thermal/electromagnetic  model  is  being  developed  for  the  interaction  of  microwaves  with  a 
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lossy/complex  absorber  material.  The  theory  is  based  on  the  complex  form  of  Poynting's  Theorem  for 
an  absorbing  material  with  complex  constitutive  parameters.  The  broad-band  frequency  dependence 
of  the  complex  constitutive  parameters  of  the  IR  detection  screen  material  must  be  determined  and 
used  in  the  model  of  the  detector  screen  along  with  its  thermal  properties. 


4.2.2  Experimental  calibration 

The  intensity  levels  (equi-color  levels)  of  the  IR  thermograms  can  be  empirically  calibrated  using 
standard  gain  horn  antennas  at  several  frequencies,  angles  of  incidence,  and  polarizations  in  the  near 
and  far  fields  of  the  antenna.  The  predicted  field  level  is  simply  associated  with  the  resulting  color 
level. 

5.  Results 

The  electric/magnetic  field  distributions  can  be  predicted  for  a  well-known,  theoretically 
tractable  problem  geometry,  e.g.  scattering  from  a  linear  wire  antenna,  scattering  from  a  thin 
rectangular  slot  aperture,  antenna  radiation  in  the  near  or  far  field  of  a  rectangular  horn  antenna, 
induced  cavity  modes  in  a  rectangular  or  cylindrical  waveguide,  etc.  The  scattering  predictions  can 
be  made  for  a  normally  or  obliquely  incident  plane  wave  as  a  function  of  frequency,  angle  of  incidence, 
and  horizonal/vertical  polarization.  These  theoretical  predictions  can  be  verified  experimentally  to 
validate  the  IR/EM  interaction  model  for  each  problem  geometry. 

The  results  of  the  initial  calibration  test  for  electric  field  measurements  are  presented  for  a  lossy 
Kapton  detector  screen  developed  to  measure  the  absolute  magnitude  of  the  electric  field  in  the  plane 
of  the  detector  screen. 


5.1  Experimental  setup 

As  the  first  step  in  calibrating  the  detector  material,  an  experiment  was  performed  on  a  planar 
sheet  of  the  detector  material.  In  the  experiment,  the  screen  was  positioned  in  front  of  a  standard  gain 
horn  antenna  and  oriented  in  several  different  planes  making  different  angles  relative  to  the  antenna. 
For  one  series  of  IR  thermograms  the  screen  was  positioned  in  the  axial  plane  in  the  near  and  far  fields 
of  the  horn.  For  this  case,  the  normal  to  the  screen  was  oriented  perpendicular  to  the  direction  of  the 
incident  radiation  (perpendicular  incidence).  The  camera  was  placed  in  front  of  the  screen  looking  at 
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the  thermal  pattern  on  the  screen.  In  other  tests,  the  normal  to  the  screen  was  rotated  to  different 
angles  relative  to  the  bore-sight  of  the  horn.  The  frequency,  angle  of  incidence,  and  polarization  of  the 
incident  wave  were  varied. 

5.2  Experimental  data 

The  induced  temperature  distribution  in  the  plane  of  the  detector  screen  was  mapped  with  the 
IR  camera.  The  measured  temperature  on  the  bore-sight  of  the  horn  antenna  was  compared  to  the 
value  of  the  incident  field  calculated  using  the  Friis  Transmitter/Receiver  Formula  with  the  known 
and/or  measured  characteristics  of  the  antenna,  corrected  for  all  losses  and  mismatches.  Forward  and 
reverse  power  were  measured  with  calibrated  power  meters.  The  distance  to  the  screen  from  the 
phase  center  of  the  horn  and  the  angle  of  the  screen  relative  to  the  bore-sight  of  the  horn  antenna 
were  measured  for  each  configuration. 

Experimentally  obtained  temperatures  from  the  IR  thermograms  were  plotted  against  the 
incident  power  densities  for  each  case.  A  series  of  thermograms  were  taken  in  the  1  to  2  GHz  range 
of  frequencies  at  distances  from  1  to  2  meters  from  the  antenna.  The  background  ambient 
temperatures  for  each  case  were  also  recorded.  Many  other  configurations  were  tested  at  different 
distances  and  at  various  other  frequencies,  polarizations,  and  angles  of  incidence. 

On  the  average,  the  corrected  temperature  differences  between  the  various  cases  was  less 
than  ±  0.5  °F.  This  temperature  error  translates  into  a  ±  1  dB  field  measurement  error. 

6.  Conclusions 

Initial  empirical  results  from  anechoic  chamber  tests  indicate  that  the  induced  temperature 
distribution  across  the  IR  thermogram  in  a  properly  designed,  lossy,  planar  IR  detection  screen  placed 
in  the  area  over  which  an  EM  field  is  to  be  mapped  can  be  calibrated  to  measure  the  incident  EM  wave 
using  the  Joule  heating  of  the  screen  material. 

The  IR  measurement  technique  is,  therefore,  a  viable  method  to  aid  in  the  determination  of  EM 
fields  in  various  test  situations.  The  IR  method  allows  for  rapid  observation  of  EM  field  activity  and 
interference,  resulting  in  an  in-depth  understanding  of  EM  scattering  phenomena.  An  experimental 
technique  is  being  used  to  measure  and  calibrate  the  absolute  magnitude  of  the  field  intensity.  A 
theoretical  approach  to  this  calibration  problem  is  also  being  undertaken. 
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7. 


Future  Work 


The  development  of  a  suitable  magnetic  detection  screen  is  in  progress.  The  use  of  spin-on 
magnetic  thin  films  is  being  investigated. 

It  is  also  possible  to  estimate  the  phase  of  the  incident  wave  by  making  a  numerical  model  of 
the  thermal/electromagnetic  interaction  in  the  detector  screen  material,  and  to  used  energy 
minimization  techniques  to  estimate  the  phase  of  the  incident  wave.  Microwave  holographic 
techniques  can  also  be  used  to  determine  the  phase  of  the  wave  incident  on  the  IR  detector  screen  if 
the  magnitude  of  the  incident  electric  field  is  measured  at  several  different  positions  in  the  near  field 
of  the  radiation  source.  This  phaseless  measurement  work  is  being  done  in  conjunction  with  Syracuse 
University  (Prof.  Tapan  Sakar)  and  the  National  Institute  of  Standards  and  Technology  (NIST)  (Dr.  Carl 
Stubenrauch). 

8.  Publications 

The  papers  published  on  this  project  during  the  AFOSR  Summer  Research  Program  are  listed 
in  the  Appendix. 
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FLEXIBLE  ADJUSTMEMT  OF  DATA 


Michael  A.  Pittarelli 
Associate  Professor 

Department  of  Computer  and  Information  Science 
SUIMY  Institute  of  Technology  at  Utica/Rome 

Abstract 

Databases  may  exhibit  many  forms  of  incompleteness.  This  report  explores  methods 
for  overcoming  incompleteness  in  the  form  of  missing  tuples.  Specifically,  algorithms  are 
investigated  for  replacing  a  relation  that  is  known  to  be  incomplete  with  a  superset.  Of 
particular  interest  are  algorithms  that  make  available  at  any  time  an  approximation  and  with 
the  property  that  the  approximate  solution  improves  monotonically  with  computing  time. 
How  to  measure  the  quality  of  an  approximation  depends  on  the  use  to  which  a  relation  is  to 
be  put.  Although  this  report  concentrates  on  rule  matching  in  rule-based  systems,  application 
of  these  and  similar  methods  to  problems  of  knowledge  discovery  in  incomplete  databases  is 
also  discussed. 
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FLEXIBLE  ADJUSTMENT  OF  DATA 

Michael  A.  Pittarelli 


The  squirming  facts  exceed  the  squamous  mind, 
If  one  may  say  so.  And  yet  relation  appears, 

A  small  relation  expanding  like  the  shade 
Of  a  cloud  on  sand,  a  shape  on  the  side  of  a  hill. 
[8.  p.  215] 


Database  instances  may  exhibit  many  forms  of  incompleteness.  The  set  of  attributes 
may  be  too  small  to  permit  adequate  characterization  of  the  entities  under  consideration. 
Similarly,  the  set  of  values  for  one  or  more  attributes  may  not  be  adequate.  It  may  be 
necessary  to  replace  {OK.  Not  OK}  with  {Too_Low.  Acceptable,  TooHigh},  for  example. 
Some  attribute  values  may  be  unknown  for  or  inapplicable  to  some  entities.  Various 
extensions  of  the  relational  model  have  been  proposed  to  deal  with  this  last  type  of 
incompleteness  [3.4].  We  consider  databases  some  tuples  of  which  are  believed  to  be 
missing  in  their  entirety.  The  tuples  may  be  absent  due  to  a  problem  in  transmitting  a  copy 
of  a  relation.  They  may  be  missing  because  data  gathering  has  been  terminated  prematurely. 

In  particular,  we  explore  methods  for  replacing  with  some  superset  a  relation  instance 
that  is  believed  to  be  missing  one  or  more  whole  tuples.  How  exactly  to  proceed  depends  on 
the  use  to  which  the  data  are  to  be  put. 

In  all  cases,  a  relation  instance  r  is  replaced  with  an  f  such  that  rC.r'Cdom(s(r)), 
where:  s(r)  is  the  scheme  of  r,  the  set  of  attributes  on  which  r  is  defined;  for  any  attribute 
Aes(r),  dom(A)  is  the  set  of  possible  values  for  A  for  a  set  W  of  attributes  (and  a  fixed 
but  arbitrary  linear  ordering  on  the  elements  of  W), 


dom(W)  = 


X  dom(A). 
Aew 


A  database  scheme  (or  structure )  is  a  set  of  relation  schemes,  i.e.,  a  set  of  sets  of 


attributes.  The  scheme  for  a  relational  database  D  =  { r i . rn }  is 


s{D)  =  {s(rj) . s(r„)}. 
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The  refinement  relation  on  database  schemes  will  be  important  in  what  follows.  A 
scheme  X  is  a  refinement  of  scheme  Y  (and  Y  is  a  coarsening  of  X ),  denoted  X<  Y,  iff  for 
every  VxeX  there  exists  a  Vye  Y  such  that  VXC  Vy.  X  is  an  immediate  refinement  of  Y  iff 
X<  Y,  X+  Y,  and  there  is  no  Z=hX  such  that  X<Z  and  Z<  Y. 


Example:  {{ A,B},{C }}  is  an  immediate  refinement  of  {{A.  B) ,{C,  D}}\  {{A}.{i3}}  is  a 
refinement,  but  not  an  immediate  refinement,  of  {{A,B},{C.D}}. 

Suppose  tuple  t£dom(M/)  and  that  VCW.  The  restriction  of  t  to  V  is  the  tuple 
t[l/]edom(l/)  such  that  t  agrees  with  t [ V/]  on  all  attributes  in  V.  The  projection  of  a 
relation  ronto  l/Cs(r)  is  then 

*v{r)  =  {t[V]\t£r}. 


Example:  The  projection  of 


onto  {A.C}  is  the  relation 


r  (A  B  C) 

31  ^1  c2 

al  ^1  c3 

d\  ^*2 

c?2 

a3  ^3  c3 


7r(AC}(r)  (A  C) 

31  c2 

31  C3 

a2  Cl 

a3  c3 


The  projection  of  a  relation  onto  a  database  scheme  is  a  database,  the  set  of  projections 
of  the  relation  onto  the  elements  of  the  scheme: 

*x(r)  =  {nv{r)  |  VeX}. 

An  operation  complementary  to  projection  is  extension.  The  extension  of  a  relation  r  to 
lOs(r)  is 


Ev(r)  =  {r'edom(y)  |7Ts(r)(r')  =  r}. 

Thus,  the  extension  of  a  relation  is  a  set  of  relations:  i.e.,  a  set  of  sets  of  tuples.  The  ele¬ 
ments  of  Ev(r)  are  partially  ordered  by  the  subset  relation.  The  unique  maximal  element  is 
the  cylindrical  extension  of  r,  denoted  Cv{r)\ 

For  all  r'eEv{r).  r'CCv{r)eEv{r). 

Alternatively,  with  the  usual  relational  algebra  collapse  of 

<<a\ . an>,<b\ . bk» 
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to 


^  t  .  .  .  , b^ ,  f  bf(  ^ 


and  reordering, 

Cv{r)  =  rXdom(V—s[r)). 

The  natural  join  (or  join )  of  a  database  D  =  { r i . rn},  denoted  -/(0),  is  the  inter¬ 

section  of  the  cylindrical  extensions  of  its  elements  to  the  union  of  their  relation  schemes.  Let 

V  =  Ur6Ds(r). 

Then 

J(D)  =  nreDCv(r). 

Theorem  1:  X<  T<{s(r)}  implies  J{-KY[r))  C  J(nx{r)). 

Proof:  Suppose  that  X</<{s(r)}.  For  each  VxEX,  there  exists  a  VyE  Y  such  that  for 
every  t€ ^vx{r)  there  exists  a  tG7Ti^(r)  such  that  t  —  t'Jl/*].  Thus,  for  each  Vx€EX,  there 
exists  a  VyE  Y  such  that 

cs(,1KM)  c  c'W^^w). 

Therefore, 

cn^KW): 

i.e.,  -/(TTy'(r))  C  J(irx[r)).  □ 

Corollary  2:  Let  A"<{s (/•)}.  Then  r  C  J(nx[r)).  □ 

Consider  production  rules  of  the  form 

IF  < database  tuple >  THEN  <action> 

Suppose  that  we  have  available  to  us  (as  a  knowledge  base)  the  relational  database  instance 
r,  above,  and  that  the  tuples  326303  and  3363C1  should  also  be  present,  but  are  missing. 
Suppose  our  rulebase  consists  of: 

Rl:  IF  336303  THEN  A1 
R2:  IF  336302  THEN  A2 
R3:  IF  3361c1  THEN  A3 

None  of  the  condition  tuples  appears  in  the  relation  r.  Suppose  that  additional  tuples  are  not 
forthcoming  and  that  we  must  execute  one  of  the  three  actions.  How.  in  the  absence  of 
additional  information,  can  we  decide  which  one? 

We  could  select  a  rule  to  fire  at  random.  A  more  principled  approach  to  rule  selection 
utilizes  Theorem  1  and  Corollary  2:  Find  a  refinement  of  {s(/")}  onto  which  to  project  r  and 
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join.  The  resulting  relation  will  be  a  superset  of  r  and  may  contain  one  of  the  condition 
tuples.  There  remains  the  problem  of  identifying  a  suitable  database  scheme. 

The  schemes  to  be  considered  will  be  limited  to  those  that  are  irredundant  covers  of 
s(r).  X  =  {Vi . Vn}  is  an  irredundant  cover  of  a  set  of  attributes  l/iff 

(!)  uVi&x  =  v 

(2)  i*HV£Vj. 

Under  the  refinement  relation  on  database  schemes,  the  set  of  irredundant  covers  of  any  s(r) 
is  a  lattice  with  maximum  element  {s(r)}  and  minimum  element  {{i/}|i/^s(r)}.  A  search 
for  a  suitable  scheme  may  be  conducted  as  a  traversal  of  the  lattice:  start  with  {s (/•)}, 
replace  a  scheme  X  with  an  immediate  refinement  /.  and  terminate  when  projecting  onto  the 
current  candidate  scheme  and  joining  yields  one  or  more  of  the  condition  tuples  (or  when  Y 
=  {{y}|y£s (/")})■  Progress  toward  termination  is  made  at  each  step,  by  Theorem  1  and  the 
fact  that  {{  v}  |  v es(r)}  is  the  minimum  element  of  the  lattice. 

Unfortunately,  the  lattice  of  schemes  is  not  a  chain.  Only  |s(/-)|+l  of  the  elements  have 
a  single  immediate  refinement.  (When  |s(r)|  is  4,  for  example,  there  are  114  irredundant 
covers.)  As  an  alternative  to  breadth-first  search,  consider  the  algorithm  below,  where  T 
denotes  the  set  of  condition  tuples  and  d(r,  r')  is  the  Hamming  distance  between  the 
characteristic  functions  of  r  and  r'.  (Note  that  d(r,  r)  is  just  the  cardinality  of  the  sym¬ 
metric  difference  of  r  and  r'\  therefore,  the  domains  of  the  attributes  needn’t  be  known  to 
calculate  d.  and  tuples  that  appear  in  neither  r  nor  r'  can  be  ignored.) 

Algorithm  1. 

Input:  r  and  T.  Output:  Z. 

Z:={s(r)}-. 

while  J[Txz{r))  n  T  =  0  and  Z  =£  {{v}| vGs(r)}  do 
begin 

Ref.=  {  Y  |  Y  is  an  immediate  refinement  of  Z}: 

Ref'  := Ref. 
match—  false; 
while  /?e/’¥=0  do 
begin 

X:=  select  (Refj; 
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Ref.=  Ref-(X}: 
if  TnJ(nx(r))  =fc  0  then 
begin 

if  match  =  false 
then 
begin 

^ least 

match  :=  true 
end 
else 

if  |  rrUfeW)  |  <  I  TnJ(xZhM(r))  | 
then  2te,  :=  X 
end 
end 

if  match 
then  Z:=  Z/east 

else  Z:=  arg  min XeRef  d{r.  J{i r*(r))) 
end 

The  algorithm  will  output  a  structure  Z  such  that  exactly  one  of  the  following  conditions  is 
met: 

(a)  Z  =  {{ i/}|  (/")}  and  T  n  J[nz(r))  =  0 

(b) 

T  n  J[nzir))  *  0: 

for  all  proper  aggregates  X  of  Z  visited. 

T  n  J{irx{r))  =  0; 

and,  for  all  siblings  Y  of  Z, 

\T  n  J[nz{r))  |  <  \  TnJ(*Y{r))\. 

Let  c  =  |  T  n  J(nz{r))  |  at  the  termination  of  the  algorithm.  If  c  =  1.  exactly  one  rule 
can  be  fired.  If  c  >  1,  then  some  way  must  be  found  to  resolve  the  conflict  among  the  rules 
that  now  can  fire.  If  c  =  0.  more  data  must  be  obtained.  In  each  of  the  latter  two  cases, 
there  is  nothing  more  that  relational  algebra  can  contribute. 

The  rationale  for  the  use  of  a  distance  measure  is  twofold.  First,  it  cuts  down  the  size 
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of  the  search  space.  But  the  same  reduction  could  be  achieved  by  selecting  an  element  of 
Ref'  at  random.  Instead,  the  frontier  element  the  projection  onto  which  yields  a  relation 
closest  on  a  reasonable  metric  to  the  original  r  is  picked  to  refine  further.  We  want  to  pro¬ 
duce  tuples  that  are  not  elements  of  the  original  relation  r,  but  we  don't  want  them  produced 
at  random.  The  assumption  is  that  some  tuples  are  missing  and  that  they  can  be  recovered 
by  processing  r  appropriately.  It  is  assumed  that  r  is  an  approximation  to  a  relation  r*  and 
that  r*  contains  one  or  more  condition  tuples.  For  any  scheme  X.  J( ^x{r))  ls  a  candidate  for 
the  unknown  r*.  By  Theorem  1.  for  any  immediate  refinement  Y  of  X.  J{-Ky{r))  is  more 
likely  to  contain  condition  tuples  than  is  _/( TTx[r)).  However,  the  immediate  refinements  of  X 
differ  with  respect  to  the  quality  of  the  approximation  to  r  they  ofFer.  As  a  search  heuristic,  it 
is  assumed  that  the  better  the  approximation  to  r.  the  better  the  approximation  to  the 
unknown  r*  (which  r  is  assumed  to  resemble). 

Definition:  r  is  reconstructable  from  X  iff 

r=J{irx{r)). 

r  is  approximately  reconstructable  from  X  to  degree  e  ifF 

d(r,  -/(**(/•)))  <  e. 

A  relation  scheme  satisfies  the  join  dependency  *[X]  ifF  each  of  its  instances  is  reconstruct¬ 
able  from  X  [9], 

Join  dependencies  are  implied  by  functional  dependencies  (but  not  conversely).  Func¬ 
tional  dependencies  are  more  easily  grasped  intuitively  than  are  join  dependencies.  An 
instance  r  satisfies  a  functional  dependency  from  A  to  B  (sets  of  attributes),  denoted  A^B, 
ifF,  for  any  tuples  t  and  t'  in  r, 

t\A]  =  ?[A\  implies  t[B\  =  t'[B]. 

Theorem  3:  If  r  satisfies  A— >£.  then 

r  =  ^{4ue.4u(s(/-)-e)}(''))- 

Proof: 

(i)  By  Corollary  2, 

r  Q  J{n{A\JB,Au(s(r)-B)}{r))- 

(ii)  Suppose  t  e  J{n{AuB.Au(s(r)-B ]}{'))■  Then 

t  e  CsW(7T„u£j(r))  and  t  <E  C* ^ (w^ u(^(rj — (#’))- 
Then  there  exist  f  and  t"  in  r  such  that 

t'[AuB\  =  t[AuB] 
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and 

f"[Au(s(r)-£)]  =  f[4u(s  (r)-£)]. 

Then,  since  A—>B, 

f[B\  =  t"[B]  =  ([6]- 

Therefore,  t  =  t"  €  r.  So, 

■J [^{AuB.AuisiA-B)} (r))  Q  r.O 

Any  set  of  database  schemes  (e.g..  the  set  of  immediate  refinements  of  a  scheme  X) 
partitions  a  set  of  relation  instances  into  equivalence  classes.  The  equivalence  class  associ¬ 
ated  with  scheme  Z  is  the  set  of  relations  for  which  Z  is  the  best  approximation  (among  all 
of  the  schemes  under  consideration,  with  ties  broken  arbitrarily).  Scheme  Z  is  a  better 
approximation  of  r  than  is  Y  ifF 

d(r.J(nz{r)))  <  d(r. J(7rK(r))). 

A  distance  measure  d  is  monotonic  with  respect  to  refinement  ifF  Y<X  implies  that  X  is  a 
better  approximation  (with  respect  to  d),  for  any  r.  than  is  /.  The  Hamming  distance 
between  (characteristic  functions  of)  relations  is  monotonic. 

Theorem  4:  Y  <  Ximplies  d(r,J( ^x{r)))  ^  d(r,J(TrY(r))). 

Proof:  If  d  is  Hamming  distance  between  characteristic  functions,  then,  for  any  relations  r 
and  r', 

d(r.r')  =  d(r',r)  =  |r©r'|. 
where  ©  denotes  symmetric  difference: 

r®r‘  =  (r-r)  U  (r'-r). 

Suppose  Y <X.  Then 

d(r,J(nx(r)))  =  |f ©7(,rx(r))  | 

=  |  {r-J{nx{r)))  u  (^(*x(/'))_r)  I 

=  \J{*x{r))~r\ 

<  \J{wY{r))-r\ 

=  d  (r,  J  (n  Y(r))) . 

since,  by  Corollary  2, 

r  C  J{nz[r)): 

and,  by  Theorem  1, 
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J{nx{r))  Q  J{*Y{r)).  □ 

So,  with  respect  to  Hamming  distance.  X  is  as  good  an  approximation  of  r  as  are  any  of  its 
immediate  refinements.  Thus,  if  r  is  reconstructable  from  Y  and  Y  is  an  immediate 
refinement  of  X.  then  r  is  reconstructable  from  X  also.  And,  if  r  is  approximately  recon¬ 
structable  from  /  to  degree  e,  it  is  also  approximately  reconstructable  from  X  to  degree  e,  if 
Y  <  X. 

Again,  let  T  denote  the  set  of  condition  tuples.  We  want  to  follow  a  path  in  the  lattice 
of  schemes  from  the  root,  (s(r)},  to  a  structure  X  such  that: 

(1)  r  *  J{nx{r)) 

(2)  d(r,  J{irx{r)))  <  e 

(3)  Tf1%(r))#0. 

(4)  For  all  Y  ^  X on  the  path  from  the  root  to  X,  T  n  J{'Ky{''))  =  0- 

There  is  no  guarantee  that  any  of  these  conditions  except  (2)  can  be  satisfied  (by  making  e 
large  enough).  A  fortiori,  there  is  no  guarantee  that  all  four  can  be  met  simultaneously.  Typi¬ 
cally.  however,  they  will  be  satisfiable. 

The  given  relation  r  does  not  equal  the  unknown  relation  t*.  However,  the  assumption  is 
that  r  resembles  r*  closely  enough  so  that,  relative  to  a  fixed  set  of  structures  (e.g.,  all  those 
at  the  same  path  length  from  the  root  of  the  lattice  of  structures),  r  and  r*  are  in  the  same 
equivalence  class:  the  best  structure  for  r*  is  also  the  best  structure  for  r.  Let  X  denote  this 
structure.  Ideally. 

(a)  r  *  J(nx{r)). 

(b)  r*  =  J{xx{r*)). 

and 

(c)  for  all  VtiX.  t xv{r)  =  7r„(r*). 

If  these  conditions  hold,  then 

r*  =  J{nx{r)). 

Conditions  (a),  (b)  and  (c)  imply  that  the  projections  of  r  are  better  approximations  of 
the  projections  of  r*  than  r  is  of  r*.  If  in  fact  r  C  r*.  then  this  is  guaranteed  to  be  the  case: 

Theorem  5:  If  r  C  r*.  then,  for  any  V  C  s(r*)  =  s(r).  d(r,  r*)  >  d(7rv(r),  'KV(r*)). 

Proof:  Suppose  r  C  r*.  Then  n\/{r)  C  n v{r*).  Then 

d(r.  r*)  =  |  r*—r  | 
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and 


d{*v{r).  ir i /{r*))  =  \*Ar*)~*v{r)\- 

Suppose  ti  G  7r^/(r*)  — -  Then  there  exists  a  t2  G  r*—r  such  that  t*  =  ^2  [  ^1  ■  Suppose 
f3  G  nv{r*)—  nv{r)  and  t3  ¥=  tj.  Then  there  exists  a  f4  G  r* — r  such  that  t4[l/]  =  t3  and 

t4  =£  t2-  So, 

I  r*—r  |  >  |  ?r „(/•*) -?r „(r)  |. 

which  implies 

d(r.  r*)  >  d(Trv(r).  7Ty(r*)).  □ 

If  condition  (b)  holds,  and  X  =£  (s(/-*)}.  then  r*  embodies  the  sort  of  redundancy  that 
causes  the  anomalies  that  normalization  by  decomposition  is  intended  to  eliminate.  Of 
course,  in  the  database  design  context,  functional  (and  other)  dependencies  implying  decom- 
posability  (i.e.,  join  dependencies)  are  inferred  from  the  "meanings"  of  the  attributes,  and 
hold  for  any  instance  over  the  given  scheme.  Here,  we  are  testing  directly  for  join  dependen¬ 
cies  (or  approximate  join  dependencies)  in  a  single  relation  instance. 

Redundancy  in  r*  coupled  with  the  greater  fidelity  of  tt v[r)  to  nv(r*)  would  explain  the 
success  of  this  technique.  Further,  there  is  good  reason  to  expect  decomposability  (redun¬ 
dancy)  in  the  systems  that  we  actually  encounter  [7]. 

Let  us  demonstrate  the  search  algorithm  using  the  relation  r  and  the  three  rules  above. 
Recall  that  none  of  the  three  rule  antecedents  is  contained  in  r.  s  (r)  =  {{>4.  B,  C}}  and  has 
one  immediate  refinement,  namely  {{A  B}.{A,  C},{B,  C}}.  The  projection  of  r  onto  this 
structure  is  the  database: 


(4 

B)  *{A.c)[r) 

(A 

0 

*fae>M  ( B 

0 

ai 

bi 

ai 

c2 

bx 

c2 

a\ 

ai 

c3 

bi 

C3 

a2 

bz 

a2 

Cl 

b2 

C2 

az 

bz 

az 

c3 

bz 

C2 

bz 

C3 

/■equals  the  join  of  these  projections.  The  structure  {{A,B},{A,C}.{B,C}}  has  three  immedi¬ 
ate  refinements:  {{A  B}.{A,  C}}.  {{A  C}.{B,  C}},  and  {{B.  C},{A,  B}}.  The  join  of  the  pro¬ 
jections  onto  {{A  B},{A.  C}}  adds  a  tuple,  but  it  is  not  an  antecedent.  The  structure 
{{B,C},{A,B}}  adds  two  tuples,  neither  of  which  is  an  antecedent.  However,  one  of  the  two 
extra  tuples  contained  in  the  join  of  the  projection  of  r  onto  {{>4,  C},{B,  C}}  is  an  antecedent: 
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•7(fl~ffAC),ffl.c)}(/'))  (A  B  c) 

d\  b\  C2 

31  t>i  C3 

d\  C2 

a2  ^3  C1 

a3  ^3  c3 

32  63  C3 

33  63  Cj 

The  search  terminates,  and  rule  R1  fires. 

Each  of  the  approximations  of  r  generated  using  the  method  just  discussed  is  a  superset 
of  r.  However,  there  are  more  supersets  /Or,  r'£dom(s(r)),  than  there  are  refinements  of 
{s(r)}.  In  the  example  above,  suppose  that 

dom(A)  =  {ai.a2.a3}, 
dom(B)  =  {61.62./J3}. 
dom(C)  =  {ci.c2.c3}. 

Since  |r|  =  5  and  \dom(s(r))\  =  27.  r  has  222— 1  (proper)  supersets.  {s(r)},  on  the  other 
hand,  has  only  8  (proper)  refinements.  (The  ratio  of  the  number  of  refinements  to  the 
number  of  supersets  increases  with  |s(r)|.  Suppose  \dom(s[r))\  =  4,  \dom[Aj)\  =  2,  and  |r| 
=  8.  Then  r  has  28— 1  =  127  proper  supersets  and  {s(r)}  has  113  proper  refinements.) 

These  superset  approximations  can  be  combined  in  various  ways.  The  intersection  of 
any  collection  of  supersets  of  r.  e.g. 

{•^(7rx(r))l^  is  at  Path  length  /rfrom  {s  (/-)}} 

is  also  a  superset  of  r.  Likewise,  the  union  of  the  J[TXx[r))  is  a  superset  of  r.  Furthermore, 
there  may  not  exist  a  Y  <  {s(r)}  such  that  J(7rY(r))  equals  this  union  or  the  intersection. 

Example:  Suppose  that  the  data  are  instead 

r*  [A  B  q 

ai  ci 

32  ^2  c2 

a3  k  C3 

Suppose  \dom(Aj)\  =  3.  Then 

-/(^{{Ae}}.{c}}(^#))  =  7T{A0}(r#)  x  dom ( C) , 

-/(7r{{AC}}.{e}}('"#))  =  7r{AC}(r^)  X  dom{B).  ■ 

J{*{{B.c}}.{j 4}}(,'#))  =  7r{B.c}ir#)  x  dom(A). 

The  cardinality  of  each  of  these  relations  is  9.  However,  r #  is  contained  in  each  of  them. 
Hence,  the  cardinality  of  their  union  is  at  most  24.  Each  of  these  structures  has  the  same 
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immediate  refinement,  namely 


{{A}.  {B},  {C}}. 

the  universal  lower  bound.  The  join  of  the  projection  onto  this  structure  is 

J{*{{A}.{B}.{c}}{r#))  =  dom(A)  X  dom{B)  X  dom{C), 
the  cardinality  of  which  is  27.  So  there  is  no  structure  in  the  lattice  the  join  of  the  projection 
onto  which  equals  this  union. 

Note,  however,  that  if  none  of 

Jfaxx  (r)) . -l{%xk[r)) 

contains  an  element  of  T,  then  neither  will  their  intersection  or  union.  In  other  contexts 
(expanding  a  relation  for  purposes  of  detecting  patterns),  forming  these  unions  and  intersec¬ 
tions  may  be  useful. 

The  adjustment  method  discussed  is  flexible  in  that  the  number  of  refinement  steps 
taken  depends  on  the  contents  of  T,  the  set  of  condition  tuples.  Let  Str(r,  T)  denote  the 
structure  identified  by  Algorithm  1  for  relation  r  and  set  T.  Let  Seq[r.  T)  denote  the 
sequence  of  structures 

<{*(')} . Str{r,  T)> 

generated  by  Algorithm  1  with  rand  T as  input.  Then,  for  any  T and  T,  either  Seq(r,  T)  is 
a  subsequence  of  Seq(r,  T)  or  conversely. 

Algorithm  1  is  not.  however,  flexible  in  the  sense  introduced  by  Horvitz  [2].  The  term 
’flexible  algorithm’  is  currently  understood  to  mean  an  algorithm  that  provides  approximate 
solutions  to  difficult  problems  in  such  a  way  that,  minimally: 

(1)  an  approximate  solution  is  available  at  any  point  in  the  execution  of  the  algorithm: 

(2)  the  quality  of  the  approximate  solution  increases  with  an  increase  in  execution  time. 
Additional  properties  may  include  the  following: 

(3)  preemptibility: 

(4)  continuity  of  the  function  from  time  to  quality; 

(5)  diminishing  marginal  improvement  of  quality  with  an  increase  in  computing  time. 

(See  also  Boddy  and  Dean  [1],  in  which  the  term  ’anytime’  is  used  instead  of  ’flexible’  to 
describe  such  algorithms.) 

A  tuple  search  procedure  that  is  flexible  in  this  sense  begins  with  the  projection  of  r,  not 
onto  an  immediate  refinement  of  (s(r)},  but  instead  onto  {{v}|vGs (/■)}.  Then,  if 
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|rnJ(*jr(r))|  >  1. 

and  there  is  time  to  continue  the  search,  r  is  projected  onto  immediate  aggregates  of  the 
sibling(s)  Z  of  Xfor  which  d(r,  J(irz(r)))  is  minimized. 

Algorithm  2. 

Input:  r  and  T.  Output:  Z. 

if  Tnr  =  0 
then  begin 
Z:={{v}\ves(r)y. 
stick  :=  false: 

while  |  TnJ(Trz(r))  \  >  1  and  not  stick  do 
begin 

Agg.=  {  Y  |  Y  is  an  immediate  aggregate  of  Z}; 

X least  ■—  Z, 
done  :=  false: 

while  Agg  ¥=  0  and  not  done  do 
begin 

X  :=  select(Aggr): 

Agg  :=  Agg  -  {X}: 

if|rrU(^(r))|  =  l 

then  done  :=  true 

else  if  0  <  \  TnJ(irx{r))  \  <  |  TnJ{nXleast{r))  | 
then  Xkast  :=  X 

end: 
if  done 
then  Z  .=  X 

else  if  Z  =  Xieast  then  stick  :=  true 
else  Z  .  X/eaS( 
end 
end 

At  any  point  in  the  execution  of  Algorithm  2,  the  current  value  of  Z  is  the  structure 
identified  by  the  outer  loop  as  generating  the  fewest  condition  tuples.  By  Theorem  1,  succes¬ 
sive  values  of  Z  generate  progressively  fewer  condition  tuples.  The  inner  while  loop  of 


Algorithm  2  identifies  the  structure  among  the  current  value  of  Z  and  its  immediate 
refinements  that  generates  the  fewest  condition  tuples.  It  is  possible  for  the  current  Z  to  gen¬ 
erate  more  than  one  condition  tuple,  while  each  of  its  immediate  refinements  generates  zero. 
In  this  case,  the  variable  'stick'  is  set  to  true.  Ideally,  one  of  the  immediate  refinements  of 
the  current  Z  generates  exactly  one  tuple  (in  which  case  'done'  is  set  to  true).  In  the  worst 
case  (if  not  interrupted  and  the  outer  while  loop  is  entered),  the  algorithm  terminates  with  Z 
equal  to  the  single  immediate  refinement  of  {s  (/•)},  which  consists  of  each  of  the  |s(r)| 
( |  s(r)  |  —  l)-element  subsets  of  s(r). 

Similar  methods  may  be  used  in  other  situations.  We  may  wish  to  analyze  the  data  in  a 
relational  database  for  patterns.  For  example,  whenever  Humidity  >  88  and 
BaromPressure  <  28.5  and  Temp  >  75  (F).  Rain  =  True.  Knowledge  discovery  in  data¬ 
bases  is  the  term  currently  used  for  the  subfield  of  machine  learning  concerned  with  the 
identification  of  such  rules  and  patterns  in  databases  [5].  Can  a  relation  that  is  known  to  be 
incomplete  be  expanded  by  projection  and  join  to  yield  a  superset  that  exhibits  more  of  the 
patterns  in  the  real  world  entity  that  the  database  is  intended  to  represent? 

Suppose  we  decide  to  record  which  of  the  possible  combinations  of  SEX.  HEIGHT,  and 
WEIGHT  appear  among  the  members  of  some  population  (e.g.,  residents  of  Oneida  County, 
NY).  Suppose  we  discretize  HEIGHT  and  WEIGHT:  we  recognize  only  3  values  for  HEIGHT 


(viz.,  tall,  medium,  short)  and  3  for  WEIGHT  (heavy,  medium,  light),  there  are  262,144 
different  possible  relation  instances.  Suppose  the  actual  combinations  are: 

SEX 

HEIGHT 

WEIGHT 

male 

tall 

heavy 

male 

medium 

medium 

female 

tall 

heavy 

female 

medium 

medium 

We  give  someone  a  summer  grant  to 

measure  and  record  all  of  the  combinations  of  values 

that  appear  in  the  population.  By  the  end  of  the 

summer,  the  only  combinations  that  have 

been  observed  are: 

SEX 

HEIGHT 

WEIGHT 

male 

tall 

heavy 

female 

medium 

medium 

The  actual  relation  is  the  join  of  the  projections  of  the  observed  relation  onto  the  structure 

{{SEX}, {HEIGHT,  WEIGHT}}. 
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HEIGHT  WEIGHT 


SEX 

male  tall  heavy 

female  medium  medium 

A  search  for  an  appropriate  structure  can  proceed  either  by  refinement,  starting  with  the 
universal  upper  bound  in  the  lattice  of  structures,  or  by  aggregation,  starting  with  the  univer¬ 
sal  lower  bound.  Unlike  the  situation  with  rule-matching  (or  analogous  situations  in  flexible 
adjustment  of  probabilities,  in  the  context  of  a  particular  decision  problem  [6]).  we  need  a 
problem-independent  stopping  criterion.  For  example,  termination  may  occur  when  a  rela¬ 
tively  large  increase  in  distance  is  detected  in  going  from  one  level  of  the  lattice  to  the  next. 
(If  refining,  back  up  to  the  best  structure  at  the  previously  visited  level.) 

Under  what  conditions  do  patterns  appear  in  adjusted  data  that  are  not  apparent  in 
"raw"  relational  data?  Intuitively,  it  seems  that  the  adjusted  relation  needn’t  coincide  exactly 
with  the  "actual"  relation  in  order  for  interesting  patterns  in  the  adjusted  relation  to  coincide 
exactly  with  the  interesting  patterns  in  the  "actual"  relation.  Two  objects  (e.g..  relations) 
may  be  equivalent  with  respect  to  important  qualities  without  being  identical  in  every  respect. 
(For  example,  a  crash-test  dummy  is  equivalent  to  a  human  being  for  crash-testing  pur¬ 
poses.)  These  and  other  questions  (e.g.,  the  accuracy,  using  simulated  relational  data,  of  the 
stopping  criterion  suggested  above)  remain  for  future  research. 
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A  STUDY  OF  IMPEDANCE  MATCHING  OF  MICROSTRIP  PATCH  ANTENNA 
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Abstract 


The  concept  of  impedance  matching  of  an  infinite  phased-array  antenna  over  wide  scan  angles 
by  means  of  a  coupling  circuit  in  the  feed  network  was  originally  proposed  by  Hannan  et  aL 
To  the  best  of  our  knowledge  this  concept  was  not  implemented  in  practice.  We  investigated 
the  feasibility  of  implementing  this  technique  for  microstrip  patch  arrays.  Computed  values  of 
input  impedance  as  a  function  of  scan  angles  in  E-plane,  H-plane,  and  D-plane  (45° )  were 
used  as  input  to  an  optimization  program,  which  minimized  the  reflection  coefficient 
magnitude  over  a  range  of  scan  angles.  The  output  of  the  program  are  the  line  lengths  and 
coupling  susceptances  in  the  E-plane  and  in  H-plane.  We  designed  coupling  networks  for 
matching  three  microstrip  patch  array  geometries  over  wide  scan  angles.  Preliminary  studies 
show  that  it  is  possible  to  realize  such  coupling  networks  in  microstrip-type  transmission 
medium. 
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A  STUDY  OF  IMPEDANCE  MATCHING  OF  MICROSTRIP  PATCH  ANTENNA 
ARRAYS  OVER  WIDE  SCAN  ANGLES 


Sembiam  R.  Rengarajan 


Introduction 

An  excellent  review  of  different  methods  of  matching  phased-array  antennas  over  wide  scan 
angles  was  presented  by  Knittel  [1].  Among  other  methods,  the  design  of  a  coupling  circuit  in 
the  feed  network  [2,3]  has  the  advantage  that  the  array  geometry  and  the  element  may  be 
designed  for  optimum  radiation  pattern  performance  with  the  array  elements  matched  only  at 
broadside.  Hannan  et  al.  assumed  that  the  input  impedance  of  an  infinite  planar  array  of 
dipoles  as  a  function  of  scan  angles  in  the  E-plane,  H-plane,  and  D-plane  (45  )  were  the  same 
as  the  corresponding  values  measured  for  the  central  element  of  a  7x9  array  [3].  They 
obtained  theoretical  values  for  the  line  lengths  and  coupling  susceptance  in  the  E-plane  and 
that  in  the  H-plane,  in  order  to  match  the  infinite  array  of  dipoles.  To  the  best  of  our 
knowledge  this  concept  has  not  been  implemented  experimentally. 


Pozar  states  that  there  is  a  need  for  matching  phased  arrays  of  microstrip  patch  elements  over 
wide  scan  angles  [4],  Although  this  has  been  addressed  by  Herd  [5]  by  a  proper  design  of  the 
physical  parameters  of  the  element  and  the  unit  cell  size  for  an  infinite  array,  there  are  two 
reasons  for  investigating  the  coupling  circuit  concept  for  microstrip  arrays  in  this  work.  In 
microstrip  patch  arrays,  a  coupling  circuit  may  be  conveniently  implemented  in  the  same  layer 
as  that  of  the  patches  if  there  is  enough  real  estate.  Otherwise  the  coupling  network  may  be 
printed  in  another  layer  with  a  “tile”  architecture  for  the  array,  which  is  common  these  days.  A 
second  advantage  of  having  a  coupling  network  for  matching  an  array  over  wide  scan  angles 
is  that  the  element  parameters  and  the  unit  cell  size  may  be  optimized  for  radiation  pattern 
performance  with  the  elements  matched  only  at  broadside,  without  any  consideration  on  how 
the  scan  impedance  varies.  It  may  be  pointed  out  that  if  the  array  has  a  scan  blindness  angle, 
we  can  only  match  to  scan  angles  less  than  the  blindness  angle. 
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Methodology 

Three  patch  array  geometries  were  considered  for  this  study.  The  first  two  arrays  were  those 
studied  by  Pozar  and  their  scan  reflection  coefficients  are  presented  in  Figs.  3  and  8  in  [4] 
respectively  along  with  their  physical  parameters,  are  reproduced  in  Fig.  1.  Unfortunately  the 
phase  values  are  not  available  in  [4].  Therefore  the  scan  reflection  coefficient  magnitude  and 
phase  were  recomputed  using  the  computer  program  developed  by  Herd  for  an  infinite  array 
of  stacked  patch  antennas  [5].  The  third  array  shown  in  Fig.  2  has  a  stacked-patch  geometry 
and  it  consists  of  a  superconducting  layer  at  the  bottom  (e  =24,  h=0.0169  Xo )  with  a  square 

conducting  patch  of  side  0.096  XQ ,  with  an  air  gap  of  h=0.03336  XQ  above,  followed  by 
another  dielectric  layer  (e=3.8,  h=0.0667  XQ)  with  a  square  patch  of  side  0.2325  XQ  located  at 
the  lower  surface  of  the  top  dielectric  layer.  Note  that  £  is  the  relative  permittivity,  h  is  the 
thickness  of  the  dielectric  layer  and  Xq  is  the  free  space  wavelength.  An  inset  feed  centered  at 
one  edge  of  the  lower  patch  and  0.015  A,Qfrom  the  edge  is  used.  A  square  unit  cell  of  size  0.4 
XQ  is  assumed  for  the  infinite  array.  The  scan  reflection  coefficients  for  this  were  also 
computed  using  the  same  code  [5]. 

The  design  of  the  coupling  impedance  follows  along  the  lines  of  the  procedure  given  by 
Hannan  et  al  [3].  In  [3]  wide  angle  match  was  accomplished  by  considering  6  parameters  for 
an  infinite  array.  First  a  length  of  transmission  line  1^  is  introduced  behind  the  element  port 
where  the  scan  impedance  is  known.  Then  a  coupling  susceptance  be  connected  between 
adjacent  elements  in  the  E-plane  is  determined.  Subsequently  another  line  length  ^  and 
coupling  susceptance  bh  connected  between  adjacent  elements  in  the  H-plane  is  determined. 
Finally  a  third  line  length  lc  and  a  susceptance  b  are  connected  at  the  input  of  each  feed 
element  to  achieve  a  match.  We  found  that  it  is  not  necessary  to  have  the  last  two  parameters 
to  achieve  a  match  over  wide  scan  angles.  A  good  match  is  achievable  with  only  four 
parameters  1, ,  be  ,  ^  ,  and  bh  whereas  if  we  use  three  parameters  ls ,  be  ,  and  bfi  the  input 
reflection  coefficient  is  found  to  be  slightly  higher.  Fig.  3  shows  a  schematic  representation  of 
the  coupling  network  in  E-plane.  A  similar  arrangement  may  exist  in  the  H-plane.  The  design 
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of  the  coupling  network  is  treated  as  a  four  (or  three)  parameter  optimization  problem  where 
the  objective  is  to  minimize  the  cost  function.  The  cost  function  is  specified  as  the  sum  of  the 
squares  of  the  reflection  coefficient  magnitudes.  A  variation  of  the  definition  of  cost  function 
in  terms  of  fourth  powers  of  reflection  coefficient  magnitude  etc.  were  experimented  with  to 
put  a  greater  penalty  on  high  values  of  the  reflection  coefficient  magnitude.  The  minimization 
is  carried  out  in  different  directions  using  numerically  evaluated  gradients  until  the  solution 
converged. 

Table  l  shows  the  reflection  coefficient  magnitude  and  phase  (  F  )  for  different  scan  angles  in 
E,  H,  and  D  planes  for  the  first  array  (see  Fig.l).  Tables  2  and  3  show  the  four  parameters  of 
the  coupling  network  and  the  resulting  T  for  a  maximum  scan  angle  of  75°  and  70° 
respectively.  In  the  first  case  the  maximum  voltage  standing  wave  ratio  Smai  before  and  after 
introducing  the  matching  network  are  6.7  and  1.86  respectively.  When  we  consider  a 
maximum  scan  angle  of  70  the  corresponding  figures  are  4.7  and  1.58  respectively.  Table  4 
shows  similar  results  for  a  maximum  scan  angle  of  70  with  only  three  parameters  for  the 
coupling  circuit.  Here  is  1.78  after  introducing  the  matching  network,  a  value  slightly 

greater  than  the  corresponding  figure  in  Table  3. 

Table  5  shows  the  reflection  coefficient  magnitudes  and  phases  of  the  second  microstrip  array. 
We  note  that  there  is  a  good  match  in  the  H-plane.  It  is  sufficient  to  have  only  two  parameters 
for  the  coupling  circuit,  \ ,  and  b  .  Table  6  shows  these  parameters  and  T  in  the  E-plane  and 

D-plane  after  introducing  the  matching  network.  A  maximum  scan  angle  of  35  °  is  assumed  for 
matching.  The  values  of  before  and  after  matching  are  9  and  2.1  respectively.  The 

reflection  coefficient  magnitudes  in  the  H-plane  are  unaffected  by  this  matching  network. 

A  study  of  the  third  array  showed  that  if  the  maximum  scan  angle  is  70°,  before  and  after 
introducing  the  4-parameters  matching  network  are  6.1  and  1.8  respectively.  The  latter  figure 
becomes  3.2  if  we  have  only  three  parameters  in  the  coupling  circuit.  However,  if  the 
maximum  scan  angle  is  only  60  ,  S^t  before  and  after  introducing  the  4-parameter  matching 
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network  are  4.3  and  1.4  respectively.  The  latter  figure  becomes  2.05  if  we  have  only  3 
parameters  in  the  coupling  network. 


Realization  of  the  coupling  circuit 

Capacitive  and  inductive  susceptances  required  in  the  coupling  circuit  may  be  realized  in  a 
microstrip  transmission  medium  by  printed  structures  such  as  interdigital  capacitors  and  spiral 
inductors  in  rectangular  or  circular  configuration.  In  order  to  realize  a  series  inductor  there  is 
a  need  to  do  a  wire-bonding  between  one  of  the  inductor  terminals  and  a  microstrip  line. 
Computer  aided  designs  (CAD)  of  these  elements  are  possible  with  packages  such  as  [6]. 
Initial  studies  with  [6]  showed  that  equivalent  lumped  coupling  susceptance  values  obtained  in 
our  study  of  three  microstrip  arrays  may  be  realized.  Further  work  is  required  by  including  the 
complete  scattering  parameters  of  such  elements  and  the  connecting  transmission  lines.  We 
need  to  study  the  antenna  input  matching  by  the  required  coupling  circuit  as  well  as  the 
realization  of  coupling  elements  together  as  an  optimization  problem. 

Conclusion 

We  have  revisited  Hannan’s  approach  to  matching  a  phased  array  over  wide  scan  angles  by 
means  of  a  coupling  circuit  and  shown  that  it  is  suitable  for  infinite  arrays  of  microstrip  patch 
elements.  In  this  initial  study  we  assumed  that  the  coupling  circuit  consists  of  equivalent 
lumped  capacitive  or  inductive  suscepance.  Further  work  is  required  to  match  the  array  scan 
impedance  by  considering  the  two-port  scattering  matrix  parameters  of  the  coupling  circuit 
with  the  use  of  CAD  programs  for  such  elements.  It  would  be  desirable  to  implement  this  for 
a  linear  array  or  for  a  planar  array  such  as  the  second  array  studied  in  this  work.  Further  work 
on  the  design  of  coupling  circuits  for  elements  near  the  edge  of  a  large  array  where  the 
environment  is  quite  different  from  an  infinite  array  will  be  interesting. 
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. .  Geometry  of  an  infinite  array  of  microstrip  patches.  The  ground  plane 
is  in  the  z  =  0  plane  and  the  top  of  the  substrate  is  in  the  *  =  d  plane. 


(a)  Geometry 


6r<  2.55 
4  <0.02  X0 
•  *  b  »  0.5  Xe 
L  <  0.3  X0 
W  «  0.5  Xo 
Xp<  L./4 


-  E -PLANE 

- M- PLANE 

- O-PLANE 

Zb  =  97.T1 


Reflection  coefficient  magnitude  of  an  infinite  microstrip  array. 


€r  <  12.8 
4  •  0.06  Xe 
a  *  b«  0.8  X0 
L<  0.098  Xe 

W<  a»x0 

Xp»  L/2 


- E- PLANE 

- H- PLANE 

- 0- PLANE 


Zb  <  143.11 


Reflection  coefficient  magnitude  of  an  infinite  microstrip  array. 


(b)  Parameters  and  T  for  array  1  from  fig.  3  of  [4]  (c) )  Parameters  and  T  for  array  2  from  fig.  8  of  [ 


Fig.  1  Geometry  and  parameters  of  patch  arrays  1  and  2 
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Scan 

Angle 

(deg.) 

E-Piane 

Reflection  Coefficient 

magnitude  phase 

(deg.) 

D-Plane 

Reflection  Coefficient 

magnitude  phase 

(deg.) 

H-Plane 

Reflection  Coefficient 

magnitude  phase 

_  (deg.) 

0.0 

0.0 

0.0 

0.0 

0.0 

5.0 

-57.57 

0.003 

-49.33 

0.003 

10.0 

-58.27 

0.011 

-49.56 

0.013 

-43.92 

15.0 

-59.69 

0.024 

-50.12 

0.029 

-44.69 

20.0 

-61.76 

0.042 

-50.93 

0.051 

-45.77 

25.0 

0.056 

-64.52 

0.065 

-52.04 

0.081 

-47.16 

30.0 

0.079 

-68.05 

0.091 

-53.46 

0.117 

-48.88 

35.0 

0.105 

-72.42 

0.121 

-55.27 

0.160 

-50.92 

40.0 

0.134 

-77.75 

0.153 

-57.51 

0.210 

-53.29 

45.0 

0.166 

-84.15 

0.188 

-60.25 

0.268 

-56.00 

50.0 

0.201 

-91.76 

0.225 

-63.56 

0.332 

-59.05 

55.0 

0.239 

-100.70 

0.266 

-67.46 

0.403 

-62.43 

60.0 

0.282 

-111.07 

0.311 

-71.89 

0.480 

-66.13 

65.0 

0.334 

-122.89 

0.366 

-76.70 

0.562 

-70.13 

70.0 

0.401 

-136.02 

0.435 

-81.54 

0.649 

-74.40 

75.0 

0.500 

-150.09 

0.526 

-85.92 

0.739 

-78.90 

Table  1  Reflection  coefficient  as  a  function  of  scan  angle  for  the  microstrip  patch  array  1 
(before  matching)  . 
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Scan  E-Plane  D-Plane  H-Plane 

Angle  Reflection  Coefficient  Reflection  Coefficient  Reflection  Coefficient 
(deg.) 


magnitude 

phase 

(deg.) 

magnitude 

phase 

(deg.) 

magnitude 

phase 
.....(.deg,l . 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

5.0 

0.007 

-41.05 

0.003 

42.80 

0.010 

90.95 

10.0 

0.026 

-44.71 

0.013 

46.37 

0.037 

92.50 

15.0 

-47.24 

0.030 

48.73 

0.077 

94.81 

20.0 

-51.15 

0.051 

51.13 

0.126 

97.57 

25.0 

0.128 

-55.47 

0.075 

53.67 

0.174 

MSm 

30.0 

0.163 

-60.00 

56.91 

0.217 

w!m 

35.0 

0.191 

-64.92 

0.124 

0.248 

40.0 

-70.37 

0.145 

63.49 

45.0 

0.216 

-76.60 

0.161 

66.86 

mSm 

50.0 

0.209 

-84.08 

0.168 

WM 

55.0 

0.186 

-93.76 

0.164 

74.74 

94.36 

60.0 

0.146 

0.147 

79.81 

65.0 

-143.18 

0.113 

92.41 

70.0 

144.95 

128.99 

75.0 

0.249 

-158.96 

-69.28 

Table  2  Reflection  coefficient  as  a  function,  of  scan  angle  for  the  microstrip  patch  array  1 
after  introducing  the  coupling  network  for  matching  upto  75  deg.  with  4 
parameters :  la=0. 159  X0,  be  normalized  =  0. 1 62, 1^=0.407  X0 ,  bt  normalized  = 
-0.340) 
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Scan 

E-Plane 

D-Plane 

H-Plane 

Angle 

Reflection  Coefficient 

Reflection  Coefficient 

Reflection  Coefficient 

(deg.) 

magnitude 

phase 

magnitude 

phase 

magnitude 

phase 

(deg.) 

(deg.) 

(deg.) 

0.0 

0.0 

0.0 

5.0 

-31.38 

45.13 

0.008 

91.00 

10.0 

-35.69 

0.012 

48.66 

0.029 

92.28 

15.0 

0.043 

-38.08 

0.027 

50.86 

0.062 

94.10 

20.0 

0.070 

-42.08 

0.046 

53.03 

0.100 

96.29 

25.0 

0.098 

-46.50 

0.067 

55.33 

0.138 

98.32 

30.0 

0.123 

-51.20 

0.089 

58.35 

0.170 

100.02 

35.0 

0.142 

-56.54 

0.109 

61.36 

0.191 

100.95 

40.0 

0.152 

-62.85 

0.126 

64.80 

0.199 

100.77 

45.0 

0.151 

-70.64 

0.136 

68.44 

0.188 

98.86 

50.0 

0.138 

-81.05 

0.139 

72.91 

0.159 

94.46 

55.0 

0.112 

-97.00 

0.132 

78.97 

0.110 

83.75 

60.0 

0.081 

-129.59 

0.114 

89.40 

0.052 

40.81 

65.0 

0.087 

167.99 

0.088 

112.46 

0.095 

-49.29 

70.0 

0.172 

126.65 

0.088 

162.13 

0.225 

-72.75 

Table  3  Reflection  coefficient  as  a  function  of  scan  angle  for  the  microstrip  patch  array  1 
after  introducing  the  coupling  network  for  matching  upto  70  deg.  with  4 
parameters:  la=0. 1 80  X0 ,  be  normalized  =  0.135, 1^=0.387  bh  normalized  = 
-0.290) 
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Scan  E-Plane  D-Plane  H-Plane 

Angle  Reflection  Coefficient  Reflection  Coefficient  Reflection  Coefficient 

(deg.) 

magnitude  phase  magnitude  phase  magnitude  phase 

(deg.)  (deg.)  (deg.) 

0.0 

0.0 

0.0 

0.0 

0.0 

0.0 

0.003 

-117.81 

0.006 

-114.35 

-107.21 

10.0 

0.013 

-123.57 

0.025 

-113.72 

-110.84 

15.0 

0.027 

-125.70 

0.054 

-115.40 

-114.01 

20.0 

0.043 

-129.93 

0.090 

-118.17 

0.125 

-117.99 

25.0 

0.058 

-134.49 

0.128 

-121.60 

0.172 

-123.18 

30.0 

0.069 

-139.27 

0.165 

-124.79 

0.211 

-128.43 

35.0 

0.073 

-145.03 

0.197 

-128.16 

0.239 

-134.08 

40.0 

0.068 

-152.75 

0.222 

-131.08 

0.251 

-140.30 

45.0 

0.052 

-165.20 

0.236 

-133.70 

0.248 

-147.92 

50.0 

0.028 

161.12 

0.239 

-135.59 

0.230 

-157.84 

55.0 

0.035 

64.31 

0.228 

-136.65 

0.200 

-173.10 

60.0 

0.092 

31.50 

0.202 

-135.87 

0.173 

161.34 

65.0 

0.175 

16.65 

0.157 

-131.58 

0.183 

124.54 

70.0 

0.284 

4.75 

0.094 

-112.97 

0.260 

91.75 

Table  4  Reflection  coefficient  as  a  function  of  scan  angle  for  the  microstrip  patch  array  1 
after  introducing  the  coupling  network  for  matching  upto  70  deg.  with  3 
parameters:  la=0.221  X0 ,  be  normalized  =  0.1 14,  bh  normalized  =  0.280) 
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Scan  Angle 

E-Plane 

D-plane 

(deg.) 

Reflection  Coefficient 

Reflection  Coefficient 

magnitude 

phase 

magnitude 

phase 

.Ides,) . 

(deg.) 

0.0 

0.0 

0.0 

0.0 

0.0 

5.0 

0.012 

-115.38 

0.006 

-113.95 

10.0 

0.050 

-117.76 

0.026 

-115.03 

15.0 

0.118 

-122.03 

0.057 

-116.77 

20.0 

0.225 

-128.76 

0.098 

-119.07 

25.0 

0.381 

-138.76 

0.148 

-121.79 

30.0 

0.584 

-152.83 

0.203 

-124.75 

35.0 

0.797 

-170.58 

0.260 

-127.71 

Table  5  Reflection  coefficient  as  a  function  of  scan  angle  for  the  microstrip  patch  array  2 
(before  matching) 


Scan  Angle 
(deg.) 

E-Plane 

Reflection  Coefficient 
magnitude  phase 

(deg.) 

D-plane 

Reflection  Coefficient 
magnitude  phase 

(deg.) 

0.0 

0.0 

0.0 

0.0 

0.0 

5.0 

0.027 

-117.84 

0.014 

-116.68 

10.0 

0.101 

-124.58 

0.054 

-121.64 

15.0 

0.199 

-134.66 

0.113 

-125.82 

20.0 

0,288 

-148.20 

0.181 

-131.00 

25.0 

0.327 

-167.16 

0.250 

-136.85 

30.0 

0.264 

153.88 

0.309 

-142.50 

35.0 

0.355 

24.08 

0.356 

-147.66 

Table  6  Reflection  coefficient  as  a  function  of  scan  angle  for  the  microstrip  patch  array  2 
after  introducing  the  coupling  network  with  only  2  parameters:  1,=0.091  A,0,  be 
normalized  =  0.659) 
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VERTICAL-CAVITY  SURFACE-EMITTING  LASERS 
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Abstract 

Vertical-Cavity  Surface  Emitting  Lasers  (VCSELs)  for  "Smart  Pixels"  and  optoelectronic 
interconnects  have  been  studied  under  the  RDL  Summer  Faculty  Research  Program.  The 
objective  was  to  provide  reliable  and  robust  processing  recipes  through  iterative 
fabrication/characterization  cycles.  A  working  4x4  VCSEL  array  has  been  demonstrated. 
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1  Introduction 


Semiconductor  diode  lasers  are  key  components  in  a  vast  range  of  applications  such  as 
photonic  switching,  optical  communication  and  optical  interconnects.  A  revolutionary 
form  of  these  devices,  the  VCSEL,  is  currently  undergoing  extensive  study.  VCSELs  have 
the  advantages  of  high  packing  density  with  narrow  optical  beam1  and  low  threshold 
current.2  The  desirable  round  geometry  of  the  devices  produces  circular  radiation  patterns 
which  are  superior  for  fiber  coupling  over  lateral  lasers.  The  circular  beam  may  also  be 
better  suited  for  vertical  optical  interconnections.  The  circular  beam  is  much  more  compact 
than  the  LEDs  used  in  previous  vertical  optical  interconnect  studies.3  Higher  packing 
density  can  result  from  the  narrower  beam  diameter.  The  small  size  also  facilitates  2-D 
addressable  array  fabrication  for  parallel  communications.  VCSELs  are  also  more  easily 
manufactured  than  present  commercial  laser  diodes  since  they  are  amenable  to  wafer  scale 
fabrication  and  testing. 

To  this  point,  much  research  effort  has  been  concentrated  on  novel  designs  and  processing 
approaches  for  VCSELs,  since  their  power  conversion  efficiency  is  currently  much  lower 
than  that  of  conventional  lasers.  VCSEL  devices  integrated  with  novel  optoelectronics 
have  been  also  been  investigated,  but  the  lengthy  and  sophisticated  processing  procedures 
necessary  to  produce  efficient  VCSELs  has  so  far  limited  the  success  of  these  integration 
efforts  due  to  the  daunting  complexity  of  the  resulting  heterostructures.  The  aim  of  the 
research  project  summarized  here  was  to  continue  efforts  to  develop  reliable,  robust,  and 
relatively  simple  fabrication  recipes  for  individually  addressable  VCSEL  arrays  to  allow 
practical  integration  with  other  optoelectronics. 
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2  Overview  of  Work  Performed 

The  following  tasks  were  accomplished  during  my  term  under  the  1995  Summer  Faculty 
Research  Program: 

■an  MBE  growth  recipe  for  InGaAs-based  980-  nm  VCSEL  arrays  was  formulated, 
based  on  previous  UCSB-generated  designs. 

■A  VCSEL  mask  set  for  device  isolation,  contacting,  and  related  processing  was 
created. 

■A  VCSEL  wafer  supplied  by  PRI,  Inc.  was  processed  into  devices  and  device 
arrays  using  photolithography,  ECR  etching,  chemical  etching,  metallization,  and 
other  relevant  techniques. 

■Optical  and  electronic  properties  of  the  as-grown  devices  were  characterized 

■Code  for  locating  VCSEL  cavity  resonances  was  developed;  code  for  designing 
DBR  mirrors  was  obtained  from  other  sources;  and  extensive  work  on  a  model 
treating  gain,  threshold  behavior,  and  strain  effects  in  VCSELs  was  completed. 

In  the  sections  that  follow,  we  review  these  accomplishments.  Section  three  discusses  our 
theoretical  modeling  efforts;  section  four  briefly  summarizes  device  fabrication  work 
performed;  section  five  presents  device  testing  results;  and  section  six  contains  our 
conclusions. 
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Theoretical  Modeling 


Our  modeling  efforts  focused  on  predicting  gain  and  threshold  behavior  in  980  nm 
strained  InGaAs  VCSELs.  Since  we  had  earlier  encountered  surprising  hurdles  in 
obtaining  CRAY  cpu  time  needed  for  a  full  many-body  treatment  of  strained  InGaAs  gain 
dynamics,  it  was  decided  to  pursue  the  conceptually  simpler  Fermi-golden  rule  approach.4 
This  allowed  modeling  efforts  to  coalesce  around  the  UCSB-developed  treatment  of  Scott 
Corzine  et  al..5  This  approach  offered  the  advantage  of  providing  a  more  intuitively  clear 
connection  to  electrical  pumping  properties  like  threshold  current,  slope  efficiency,  etc. 
than  more  physics-oriented  many-body  techniques.  The  Corzine  approach  to  modeling 
key  VCSEL  behavior  is  diagrammed  schematically  in  Fig.  1. 

This  approach  has  four  basic  steps.  First  and  foremost,  the  subband  structure  for  the 
quantum-well  gain  region  must  be  determined.  This  is  the  most  involved  step  from  the 
standpoint  of  theoretical  complexity  and  computational  effort.  Once  this  has  been 
accurately  determined,  a  loop  is  entered  in  which  optical  gain  and  current  density  are 
calculated  as  a  function  of  carrier  density.  During  each  pass  through  the  loop,  three  steps 
are  accomplished  in  a  self-consistent  fashion:  the  quasi-Fermi  levels  are  determined  for 
a  given  N  and  P;  the  gain  corresponding  to  these  quasi-Fermi  levels  is  calculated,  and  the 
current  density  required  to  maintain  the  quasi-Fermi  level  separation  is  solved  for. 

Next  to  the  subband  structure  determination,  finding  the  spectral  gain  is  the  most 
complicated  step  in  the  Corzine  approach.  The  gain  is  defined  as  the  product  of  three 
terms:  the  band-to-band  interaction  strength,  the  squared  field  strength  of  one  photon,  and 
the  number  of  interacting  transition  pairs.  The  second  term  is  a  function  of  the 
waveguiding  and  confinement  characteristics.  The  first  and  third  terms,  which  primarily 
depend  on  evaluating  fermi  functions  and  quasi  fermi  levels,  rely  fundamentally  on 
knowledge  of  the  subband  structure  in  the  material.  This  is  also  the  point  at  which  the 
effects  of  strain  are  most  accurately  treated. 

Because  of  its  crucial  importance,  we  tackled  the  subband  structure  first  in  our  modeling 
efforts.  Extensive  effort  was  devoted  to  developing  a  Luettinger-Hamiltonian-based 
model  allowing  for  the  calculation  of  InGaAs  valence  and  conduction  subband  structure 
for  both  strained  and  unstrained  materials.  As  it  turned  out,  in  the  limited  time  frame 
available,  this  was  as  far  as  we  got.  In  fact,  we  were  only  successful  in  developing  code 
which  predicted  the  subband  structure  of  unstrained  materials.  This  code  has  yet  to  be 
incorporated  within  the  larger  framework  required  to  predict  overall  device  gain  and 
threshold  current  It  is  likely  that  another  3-6  months  of  effort  will  be  required  to  produce 
the  working  optimization  code  originally  envisioned. 
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Theoretical  Modeling  (cont.) 


Figure  1  Flow  chart  for  QW  gain  calculations  developed  by  S.  Corzine  et  al.. 


The  program  immediately  came  up  against  accuracy  problems  and  round-off  error  effects 
in  computing  the  matrix  determinants  needed  to  evaluate  the  various  subbands.  We  were 
thus  forced  to  import  "canned"  LAPACK  routines  to  perform  these  operations.  As  of  yet, 
stability  problems  in  the  combined  routines  have  not  been  completely  overcome.  The  code 
calculates  subband  structure,  but  the  accuracy  is  questionable  far  from  the  center  of  the  1st 
Brillouin  zone  (i.e.,  far  from  k=0).  Since  these  regions  are  crucial  for  strain -related  effects, 
extensive  additional  work  will  be  necessary  to  resolve  these  issues. 
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3  Theoretical  Modeling  (cont.) 

A  basic  model  of  VCSEL  facet  reflectivity  was  developed  for  use  in  DBR  mirror  stack 
design.  Additionally,  a  copy  of  the  Randy  Geels'  original  MultiLayer  code  written  for  the 
Macintosh  was  also  obtained  from  Dr.  Geels  for  comparison  with  the  DBR  code  we 
developed.  Unfortunately,  the  Geels  program  lacks  the  capability  to  treat  material 
absorption,  which  is  definitely  relevant  in  obtaining  accurate  mirror  designs. 

To  summarize  our  theoretical  efforts,  we  made  strong  progress  in  developing  a  flexible 
and  device-oriented  model  suitable  for  realistic  evaluation  and  optimization  of  gain  and 
threshold  behavior  in  980-nm  strained  InGaAs  VCSEL  devices.  However,  additional  effort 
will  be  required  during  the  next  phase  of  the  project  to  bring  the  model  to  completion. 
Several  models  of  DBR  mirror  performance  and  associated  cavity  resonances  were 
developed  and/or  acquired.  The  resulting  suite  of  theoretical  tools  should  put  us  in 
excellent  shape  to  tackle  design  and  characterization  of  the  next  generation  of  devices  in 
future  contracts. 


4  Fabrication 

The  VCSEL  wafer  used  in  this  work  was  supplied  by  PRI  of  Longmont,  Colorado.  The 
wafer  was  grown  by  molecular  beam  epitaxy  on  a  GaAs  buffer  layer.  The  bottom  mirror 
is  Si  doped  AlAs/AlGaAs  with  29  periods.  The  active  region  consists  of  0.19  pm 
Alo.25Gao.75As  confinement  layers  surrounding  a  gain  region  with  two  80  A  Aj,^  G^  As 
quantum  wells  (QW)  and  80  A  GaAs  barriers.  The  p-type  of  mirror  is  capped  with  a  250 
A  p+  GaAs  contact  layer. 

There  are  quite  a  few  different  fabrication  approaches  used  in  making  VCSELs.  One 
relatively  simple  scheme  requiring  comparatively  few  processing  steps  is  the  "etched 
pillar"  VCSEL  design,  similar  to  those  used  in  research  groups  at  UCSB  and  elsewhere.6 
Fabrication  typically  involves  several  different  steps  of  masking,  deposition,  and  etching 
as  illustrated  in  Figure  2.  Step  1  is  used  to  define  etch-post  and  etch  masks.  Once  the 
desired  thickness  of  photoresist  is  spun  on  and  developed,  the  wafer  is  ready  to  be  etched 
by  electron  cyclotron  resonance  (ECR)  in  step  2.  The  etched  mesas  basically  serve  to  define 
the  current  injection  path,  and  the  etching  depth  is  just  below  the  active  region.  Then 
polyimide  is  used  in  step  3  to  passivate  the  side  wall  surface  of  the  VCSEL  etched  posts, 
and  polyimide  is  baked  for  a  few  cycles.  Polyimide  which  covers  the  top  of  the  mesas  will 
be  removed  by  reactive  ion  etching  (RIE)  since  the  laser  emission  is  from  the  top.  In  step 
4,  a  second  mask  is  patterned  on  the  etched  posts  for  the  p-metal  deposition.  Metal 
deposition  uses  standard  lift-off  techniques.  Finally,  n-type  ohmic  contact  material  is 
evaporated.  The  detailed  process  we  used  to  fabricate  VCSELs  is  given  in  Appendix  I. 
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Fabrication  (cont.) 


Photoresist  ECR  polyimide  P-metal 

etch  masks  etch  passivation  deposition 


J 

bL 

.  A 

-M 

- 

Step  1  Step  2  Step  3  Step  4 


Figure  2  Schematic  of  etched-pillar  VCSEL  processing  flow 

The  process  described  above  utilizes  dry  etching;  wet  etching  has  been  also  carried  out  for 
comparison.  For  wet  etching,  step  1  is  the  same  as  described  in  Figure  2.  In  step  2,  a 
mixed  solution  of  3:1;50  of  HsPO^F^O^HjO  has  been  used  for  etching.  However,  the 
etching  rate  is  difficult  to  control  and  the  resulting  mesa  surfaces  are  very  rough. 

In  our  work,  VCSELs  were  fabricated  using  the  processes  described  above  into  both  3x4 
and  4x4  arrays.  Each  VCSEL  is  designed  to  be  of  a  different  diameter  ranging  between  5 
and  30  pm  so  that  the  VCSELs'  performance  can  be  compared.  The  pitch  (laser  to  laser 
spacing)  is  also  varied  from  10  to  40  pm  for  comparison.  A  scanning  electron  microscope 
(SEM)  micrograph  of  the  completed  arrays  is  shown  in  Figure  3. 


Figure  3  SEM  image  of  completed  VCSEL  array 


5  Device  Testing 


The  testing  of  the  VCSELs  arrays  was  carried  out  at  the  Rome  Laboratory  Photonics 
Center.7  A  schematic  of  the  probe  station  set-up  used  to  perform  I-V  measurements  is 
shown  in  Figure  4.  A  Cambridge  Instruments  MicroZoom  II  is  the  central  part  of  the  probe 
station,  and  a  CCD  camera  and  monitor  are  connected  to  the  microscope.  The  microscope 
is  mounted  on  a  large  X-Y  translation  stage  and  a  laboratory  jack.  The  devices  to  be  tested 
are  electrically  mounted  on  copper  blocks  using  silver  epoxy  to  maintain  good  common 
ground  and  heat  sinking  properties.  The  devices  have  X-Y  translation,  two-axis  tilt,  and 
rotation  for  proper  positioning.  Electrical  probing  is  accomplished  using  Alessi  high- 
frequency  and  DC  probes  mounted  on  X-Y-Z  translation  stages. 


Figure  4  Probe  Station  set-up 


5  Device  Testing  (cont.) 

The  purpose  of  the  electrical  probing  is  to  supply  the  necessary  bias  and  drive  signals  to 
the  devices  and  to  pick  off  the  electrical  signals  at  the  desired  points.  The  tip  of  the  probe 
is  of  great  concern  when  contacting  the  devices.  The  Alessi  tungsten  probe  tips  usually 
make  the  better  electrical  contact  with  the  devices,  provide  a  more  stable  electrical 
connection  and  have  a  lower  contact  resistance.  An  electrical  probe  contact  indicator  is 
made  to  ensure  the  physical  contact  between  the  tip  and  the  device. 

Electronically,  the  device  is  driven  by  the  HP  8116A  Pulse/Function  Generator  (PFG).  The 
PFG  supplies  either  triangular  or  sine  pulses  of  the  desired  frequency  and  amplitude.  The 
HP  5411  ID  Digital  Storage  Oscilloscope  (DSO)  measures  the  I-V  and  L-I  curves  of  the 
devices.  A  typical  example  of  I-V  characteristics  of  individual  VCSELs  tested  is  shown  in 
Figure  5. 


Figure  5  Typical  I-V  characteristic  of  individual  VCSEL  device. 
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6  Conclusions 


We  have  demontrated  the  production  of  individually-addressable  AlGaAs/GaAs  VCSEL 
arrays.  The  fabrication  procedure  has  been  simplified  and  contains  only  two  levels  of 
masks.  The  quality  of  the  side  walls  of  the  VCSELs  etched  by  ECR  is  good.  As  a 
comparison,  wet  etching  produces  less  desirable  device  quality.  The  reduced  complexity 
of  the  VCSEL  array  fabrication  scheme  bodes  well  for  the  use  of  VCSELs  as  "Smart  Pixels" 
in  novel  optoelectronic  interconnect  applications. 
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Appendix  I  Processing  Log 

1  Wafer  clean 

aceton,  methanol,  isopropanol  and  DI  water 
ammonia  hydroproxide 
vapour  prime  30  min. 

2  Zero  level  mask 

170C  hot  plate  10-15  minutes  prior  to  photoresist  (PR) 
Shipley  primer  3000rpm  lOsec. 

CS50  6000rpm  30sec. 
pre-bake  90C  2  min. 

expose  on  10:1  stepper  1.3sec.  for  positve  tone 
post-bake  90C  lmin. 

OCG  945  40-60sec. 

3  Mesa  definition 

glue  chips  to  3"  Si  wafer  by  FSC-M,  bake  1  hour  @  90C 
ECR  setting:  10  Torr  He  @  0C  chamber  temperature,  16  seem 
C12, 4  seem  BCL3, 2  mT  chamber  pressure,  16  A  upper,  20  A 
lower  magnet  current,  400  W  microwave,  80  W  RF 
pre-run  of  ECR  300-500sec. 

1165  30min.  to  strip  PR 

4  Polyimide  (PI)  passivation 

Q3289:QZ3290=1:9  for  PI  adhesion 
spin  20sec.  @  5000rpm 
287  PI  20sec.  @  5000rpm 

post-bake  90C  30min.,  120C  30min.,  150C  15min.,  180C 
15min.,  240C  30min. 

strip  of  PI  at  the  top  of  mesas  in  RIE  8min. 

5  p-type  metallization 

Shipley  primer  3000rpm  lOsec. 

CS50  6000rpm  30sec. 
pre-bake  90C  2  min. 

expose  on  10:1  stepper  0.6sec.  for  image  reversal 
Yes  oven  85min. 

HTG  flood  60  sec. 

OCG  945  4min. 
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Appendix  I  Processing  Log  (cont.) 

6  p-type  metal  evaporation 

low  power  02  descum  lmin. 
ammonia  hydroproxide  dip  lOsec 
CHA  evaporator 

400A  Ti,  200A  Pt,  3000A  Au,  3000A  Cr,  1500A  Ni 
soak  in  1165  >45min  for  lift  off 


7  n-type  metallization 

lap  2pm  off  from  substrate 

CHA  evaporator  100A  Ni,  400A  Ge,  800A  Au,  500A  Ag,  700A  Au 
300  RTA 
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IMAGE  EXPLOITATION:  WAVELETS  RESEARCH  AND 
IMAGERY  TOOLKIT  DEVELOPMENT 
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Abstract 


The  exploitation  of  digital  reconnaissance  imagery  requires  efficient  storage,  transmission, 
image  processing,  browsing,  retrieval  and  analysis  of  large  volumes  of  image  data.  An  image 
compression  scheme  for  multiresolution  coding  by  wavelets  to  alleviate  the  transmission  prob¬ 
lem  of  high  resolution  images  over  low  speed  channels  is  developed.  This  scheme  has  the 
advantage  of  progressive  transmission  such  that  it  can  transmit  the  lowest  resolution  image  to  the 
decoder  and  the  finer  details  are  transmitted  progressively.  A  block  variance  detector  on  the 
lowest  subband  to  predict  if  the  high  subbands  contain  edge  information  is  used  to  achieve  effi¬ 
cient  compression.  The  morphological  close-minus-open  operation  is  used  for  target  recognition 
to  eliminate  background  clutter  whose  size  is  larger  than  the  targets.  A  macro  wavelet  function 
with  a  Gaussian  low-pass  filter  is  used  to  clutter  with  particle  sizes  similar  to  or  smaller  than  the 
targets.  The  graphical  user  interface  of  an  imagery  toolkit  is  developed  under  X  window  in  SUN 
Solaris  system.  It  includes  the  often-used  techniques  in  image  processing  such  as  image  manipu¬ 
lation,  enhancement,  feature  extraction,  filtering,  degradation,  statistics,  segmentation,  and  mor¬ 
phology. 
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IMAGE  EXPLOITATION:  WAVELETS  RESEARCH  AND 
IMAGERY  TOOLKIT  DEVELOPMENT 

Frank  Y.  Shih 


1.  Introduction 

The  introduction  of  wavelets  in  signal  and  image  analysis  was  a  major  breakthrough 
because  of  their  ability  to  describe  a  signal  or  image  in  time  and  in  frequency  simultaneously, 
thus  overcoming  classical  limitations  of  Fourier  analysis.  If  tracing  back,  some  of  the  pioneering 
work  of  wavelet  transform  was  introduced  by  Haar  (1910)  [6],  Gabor  (1946)  [5],  and  Morlet 
(1982)  [10].  As  it  came  into  the  attention  of  other  scientists  it  was  recognized  to  be  useful  for 
other  signal  analysis  applications.  Nowadays,  there  exist  more  than  ten  widely-used  wavelet 
transform  algorithms  (see  [4],  [6],  [9],  [14]). 

Like  sines  and  cosines  in  Fourier  analysis,  wavelets  are  used  as  basis  functions  in  represent¬ 
ing  other  functions  by  decomposition.  These  wavelet  functions  are  generated  by  dilations  and 
translations  of  a  single  basis  function  (called  the  mother  wavelet).  The  Haar  wavelet  has  been 
used  in  various  mathematical  fields.  It  is  well  known  that  any  continuous  function  can  be 
approximated  uniformly  by  Haar  functions.  Dilations  and  translations  of  the  mother  wavelet  \j/, 

\\fjk(x )  =  2jl\(2Jx  -k),  (1 ) 

where  j  and  k  are  integers,  define  an  orthogonal  basis  in  L2(R)  (the  space  of  all  square  integrable 
functions).  The  simplest  of  all  wavelets  is  the  Haar  wavelet  It  is  a  step  function  taking 
values  1  and  -1  on  [0,  1/2)  and  [1/2,  1],  respectively. 

The  2-D  wavelet  transform  of  an  input  image  I  (x,  y)  is  expressed  as 

W ( ax ,  ay,  bx,  by)  = . 7=rff I(x,  y)  \j/( - -,  - - -)  dx  dy,  (2) 

iaxay  ax  ay 

where  ( ax ,  ay)  are  dilation  parameters  in  x  and  y  directions,  and  ( bx ,  by)  are  translation  parame¬ 
ters  that  apply  the  wavelet  filter  function  \|/  to  different  local  regions  of  the  scene. 

Wavelet  transform  methods  for  image  data  produce  a  multiresolution  representation  in 
which  the  spatial  structure  of  the  image  is  preserved  within  every  level.  The  quadrature  mirror 
filters  were  developed  for  1-D  and  2-D  wavelet  decomposition  and  reconstruction  in  Mallat’s 
algorithms  [8].  The  filtering  analyzes  each  dimension  of  the  input  data  into  two  subbands:  the 
first  consists  of  the  average  information  of  the  signal  (low  frequency  components)  and  the 
second  contains  the  detailed  information  (high  frequency  components).  The  low  frequency  infor¬ 
mation  can  be  considered  as  the  approximation  of  the  image  and  the  high  frequency  information 
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as  the  details  lost  at  this  step  of  the  approximation  process. 

Let  <p(x)  is  a  1-D  low-pass  filter  and  \{/(x)  is  the  mother  wavelet  function  (a  high-pass  filter) 
associated  with  the  scaling  function.  In  two  dimensions,  the  2-D  wavelet  transform  can  be 
equivalently  computed  with  a  separable  extension  of  the  1-D  decomposition  algorithm  [8],  For  a 
matrix  of  2-D  image  data  I  (x,  y )  of  dimension  2n  x  2”,  where  n  is  an  integer,  we  first  convolve 
the  rows  of  /  (x,  y)  with  the  1-D  wavelet  filter  and  two  matrices  of  dimension  2"  x  2n~l  are 
obtained  as  shown  in  Fig.  1,  where  (})(x)  and  \j r(x)  are  a  low-pass  filter  and  a  high-pass  filter, 
respectively. 


Fig.  1.  The  2-D  image  wavelet  transform  by  a  separable  1-D  decomposition  algorithm. 

We  then  convolve  the  columns  of  the  resulting  two  matrices  with  the  1-D  wavelet  filter  and 
the  four  resulting  matrices  <Kx)<j)(y),  4>(*)y(y ),  \\r(x)ty(y),  and  y(x)\j/(y)  of  dimension  2n~l  x  2n_1 
are  obtained.  The  matrix  <f)(x)0(y)  is  a  half-resolution  down-sampled  image  of  low  frequencies 
(referred  to  as  the  thumbnail  image),  the  image  q(x)\j/(y)  gives  the  vertical  high  frequencies 
(horizontal  edges)  and  \j/(*)(j)(y)  the  horizontal  high  frequencies  (vertical  edges)  and  \|/(x)\|/(y) 
the  high  frequencies  in  both  directions  (the  comers  as  well  as  diagonal  edges). 

The  filtering  is  repeated  successively  on  the  first,  low-frequency  subband,  producing  a 
pyramid  of  subbands.  Part  of  the  rationale  for  focusing  on  the  low-frequency  subband  is  that 


most  of  the  energy  of  the  image  is  contained  in  that  portion  of  the  data.  The  output  image  of  the 
wavelet  representation  is  arranged  as  shown  in  Fig.  2,  where  the  top-left  quadrant  image  is  the 
output  for  the  wavelet  filter  <p(jc)4>(y )  (the  low-frequency  components),  the  top-right  for  <f>(x)\|/(y) 
(the  horizontal  edges),  the  bottom-left  for  \}/(;t)(|)(y)  (the  vertical  edges),  and  the  bottom-right  for 
\j/(x)\|/(y)  (the  diagonal  edges).  Note  that  the  subscript  index  indicates  the  level  of  resolution. 
The  wavelet  transform  procedure  can  be  recursively  repeated  on  the  thumbnail  image  until  a  2  x 
2  image  is  occurred  or  a  predefined  criterion  is  satisfied.  The  criterion  for  continuing  the  decom¬ 
position  can  be  based  on  the  minimization  of  a  cost  function,  such  as  total  energy  of  the  image 
representation  [3].  Fig.  3  shows  the  original  Lenna  image  and  its  three  level  decomposition  of 
the  wavelet  representation. 
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Fig.  2.  Three  level  resolutions  in  a  2-D  wavelet  transform. 


2.  Multiresolution  Decomposition  and  Progressive  Transmission 

Our  approach  for  multiresolution  decomposition  is  described  as  follows.  At  the  encoder,  the 
input  image  is  decomposed  into  several  frequency  subbands.  The  lowest  subband  is  encoded  by 
Differential  Pulse  Code  Modulation  (DPCM)  [16]  or  some  other  loseless  coding  algorithms.  The 
other  high  subbands  are  partitioned  into  blocks  to  be  vector  quantized.  Not  all  blocks  in  high 
subbands  are  needed  to  be  vector  quantized.  As  we  can  expect,  the  probability  density  function 
(PDF)  of  the  high  subbands  are  closely  approximated  by  a  generalized  Gaussian  distribution 
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(b) 

Fig.  3  (a)  Original  Lanna  image,  (b)  Wavelet  representation  on  three  level  resolutions. 
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with  p  =  0.7  [1],  That  is,  there  are  few  nonzero  values  which  contain  the  information  of  edge  in 
high  subbands.  In  Fig.  3b,  most  of  nonzero  values  almost  concentrate  on  the  edge  of  original 
image.  From  [1],  the  variance  cf  the  real  PDF  of  high  subbands  and  PDF  after  discarding  the 
smooth  corresponding  blocks  in  high  subbands  are  almost  equal.  Here  we  refer  the  correspond¬ 
ing  blocks  in  high  subbands  are  blocks  in  high  subband  whose  positions  are  corresponding  to  the 
block  in  the  lowest  subband. 


Transmission  of  high  resolution  images  over  low  speed  channels  always  presents  a  prob¬ 
lem,  owing  to  the  lengthy  transmission  time.  For  instance,  transmitting  a  picture  of  512  x  512 
pixels  at  8  bits/pixel  over  a  9600-baud  line  can  take  4  minutes.  An  approach  to  alleviate  the 
problem  is  the  multiresolution  coding  scheme  which  decomposes  the  signal  into  the  lowest  reso¬ 
lution  and  the  difference  between  the  approximation  of  original  signals  at  each  successive  reso¬ 
lution.  This  scheme  has  the  advantage  of  progressive  transmission  [12].  That  is,  it  can  transmit 
the  lowest  resolution  signal  to  the  decoder,  and  the  finer  details  are  transmitted  progressively. 
This  property  is  a  very  useful  technique  for  image  database  query.  We  can  determine  that  the 
image  is  needed  or  not  at  the  first  glance  into  the  lowest  resolution  image.  It  can  save  the 
transmission  time  and  increase  the  bandwidth  of  a  network. 


We  apply  a  block  variance  detector  on  the  lowest  subband  to  predict  the  corresponding 
blocks  in  high  subbands  whether  they  are  high  variation  or  not.  If  the  block  in  the  lowest  sub¬ 
band  is  determined  to  be  smooth,  then  we  think  the  corresponding  blocks  in  high  subbands  con¬ 
tain  all  zero  values.  Therefore,  the  corresponding  blocks  in  high  subband  are  discarded.  Other¬ 
wise,  the  corresponding  blocks  are  vector  quantized  and  transmitted  to  decoder. 


When  the  decoder  receives  the  lowest  subband  completely,  it  applies  the  same  block  vari¬ 
ance  detector  as  the  encoder’s  on  the  lowest  subband.  If  the  block  in  the  lowest  subband  is  deter¬ 
mined  to  be  high  variation,  then  the  incoming  index  to  look  up  codebooks  is  obtained  for  patch¬ 
ing  the  codevectors  on  the  corresponding  blocks  in  high  subbands.  Otherwise,  the  decoder  pads 
zero  values  on  the  corresponding  blocks  in  high  subbands. 


The  variance  detector  of  a  block  to  be  smooth  or  high  variation  is  defined  as 


a  = 


y  y  (x(m,  n)—x (m,  n))2 
M  xN 


1/2 


(3) 


where  the  M  x  N  is  the  blocksize,  the  x  is  the  mean  of  this  block.  If  a  is  greater  than  the  prede¬ 
fined  threshold,  then  the  block  is  determined  to  be  high  variation.  Otherwise,  it  is  considered  as 
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smooth.  The  scheme  can  effectively  reduce  bitrate  without  degrading  image  quality  visually. 


3.  Scanning  Pattern  Techniques 

A  scanning  pattern  approach  refers  to  any  technique  that  transmits  transform  coefficients  in 
the  order  following  a  certain  scanning  pattern  through  the  2-D  transform  coefficients.  Initially,  a 
small  number  of  coefficients,  determined  by  the  scanning  pattern,  from  each  block  are  transmit¬ 
ted  to  the  receiver  to  generate  the  first  approximation  of  the  image.  In  each  pass  of  subsequent 
transmission,  a  group  of  coefficients,  also  determined  by  the  scanning  pattern,  from  each  block 
are  transmitted  to  the  receiver  to  refine  an  existing  reconstruction. 

Differences  among  variations  of  this  approach  are  generally  in  the  scanning  patterns.  The 
optimal  scanning  pattern  should  be  the  one  that  sends  coefficients  in  the  descending  order  of 
their  variance  values  since  this  will  minimize  the  mean-squared  distortion  due  to  untransmitted 
transform  coefficients.  The  zigzag  pattern  is  an  often-used  scanning  pattern  in  threshold 
transform  coding  [2].  This  pattern  usually  generates  more  consecutive  zeros  for  coefficients 
under  the  threshold,  and  efficient  run-length  codes  can  be  employed.  An  example  of  the  zigzag 
pattern  for  8  x  8  discrete  cosine  transform  (DCT)  coefficients  is  shown  in  Fig.  4a. 

We  observe  that  if  the  ordering  of  the  transform  coefficients  are  carried  out  exactly  in  a  zig¬ 
zag  manner,  the  resulting  bit-stream  does  not  have  a  property  which  can  provide  a  low  complex¬ 
ity  multiresolution  previewing.  We  modify  the  zigzag  scanning  strategy  as  shown  in  Fig.  4b 
such  that  the  ordering  of  the  transform  coefficients  follow  the  priority  order  of  the  rectangular 
zones.  That  is,  the  order  of  the  sequence  is  almost  zigzag  within  each  subblock  of  size  1  x  1,  2  x 
2,  4  x  4,  etc. 

4.  Target  Recognition 

Morphological  processing  and  wavelet  transforms  are  combined  together  to  detect  multiple 
object  in  a  high  clutter  scene  where  multiple  classes  of  objects  with  distortions  and  contrast  vari¬ 
ations  are  present.  Mathematical  morphology  [7],  [11]  obtains  its  name  from  the  studies  of  shape 
analysis.  Use  of  this  theoretical  basis  for  image  processing  by  many  algorithmists  has  shown  that 
it  is  natural  and  easy  to  think  in  terms  of  shapes  when  one  is  designing  computer  vision  algo¬ 
rithms.  Mathematical  morphology  can  extract  image  shape  features  such  as  edges,  fillets,  holes, 
comers,  wedges,  and  cracks  by  operating  with  various  shape-structuring  elements. 

4.1.  Morphological  Operations  —  The  morphological  operators  deal  with  two  images.  The 
image  being  processed  is  referred  to  as  the  active  image,  and  the  other  image  being  a  kernel  is 
referred  to  as  the  structuring  element.  Mathematical  morphology  represents  image  objects  as 
sets  in  a  Euclidean  space.  The  basic  morphological  operations  are  dilation,  erosion,  opening,  and 


(b) 


Fig.  4.  (a)  Zigzag  scanning  pattern  for  the  block  size  of  8  x  8,  (b)  A  modified  scanning  pattern. 
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closing. 

Dilation  combines  two  sets  using  vector  addition  of  set  elements.  Dilate  is  the  locus  of  all 
centers  such  that  the  structuring  element  translated  by  placing  the  origin  at  the  centers  of  B  at  c) 
hits  the  image  set.  Dilation  by  disk  structuring  elements  corresponds  to  the  isotropic  expansion 
algorithm.  Erosion  is  the  morphological  dual  to  dilation.  It  combines  two  sets  using  vector  sub¬ 
traction  of  set  elements.  Erosion  is  the  locus  of  all  centers  such  that  the  structuring  element 
translated  by  placing  the  origin  at  the  centers  is  entirely  contained  within  the  set. 

In  practical  applications,  dilation  and  erosion  pairs  are  used  in  sequence,  either  dilation  of 
an  image  followed  by  the  erosion  of  the  dilated  result  (called  closing),  or  erosion  followed  by 
dilation  (called  opening ).  In  either  case,  the  result  of  iteratively  applied  dilations  and  erosions  is 
an  elimination  of  specific  image  detail  smaller  than  the  structuring  element  without  the  global 
geometric  distortion  of  unsuppressed  features. 

4.2.  Morphological  Close  Minus  Open  Operation  —  The  procedures  we  use  in  the  morpholog¬ 
ical  recognition  algorithm  [17]  are:  perform  an  opening  and  a  closing  of  the  input  image  with  a 
horizontal  structuring  element  and  subtract  the  opened  output  from  the  closed  output,  repeat  this 
with  a  vertical  structuring  element  to  produce  another  output,  and  combine  the  two  outputs  using 
a  grayscale  intersection  (pointwise  minimum).  The  structuring  elements  used  are  chosen  to  be 
larger  than  the  height  and  width  of  the  largest  object. 

Fig.  5a  shows  an  image  of  cars  in  a  toll  highway.  Fig.  5b  shows  the  result  of  a  closing  with 
the  vertical  structuring  element  of  height  13  (i.e.  1  x  13).  All  dark  cars  whose  height  is  smaller 
than  13  are  removed  and  replaced  by  the  background  gray  level.  Fig.  5c  shows  the  vertical  open¬ 
ing  of  Fig.  5a  with  the  same  structuring  element;  this  removes  the  bright  cars.  Fig.  5d  shows  the 
subtraction  of  Fig.  5c  from  Fig.  5b;  all  cars  are  bright  and  those  dark  cars  and  shadows  of  bright 
cars  with  distances  less  than  13  have  been  merged.  Figs.  5e  and  5f  show  the  closing  and  opening 
respectively  with  the  same  horizontal  structuring  element  of  width  17  (i.e.  17  x  1).  Fig.  5g  shows 
the  subtraction  of  Fig.  5f  from  Fig.  5e.  The  pointwise  minimum  (Fig.  5h)  of  Figs.  5d  and  5g 
shows  that  the  merged  cars  have  been  separated  and  all  the  cars  are  bright. 

4.3.  Wavelet  Transform  Combined  with  Morphology  for  Clutter  Reduction  -  The  morpho¬ 
logical  close  minus  open  operation  eliminates  background  clutter  whose  size  is  larger  than  the 
size  of  the  target  objects.  However,  all  clutter  with  particle  sizes  similar  to  or  smaller  than  the 
largest  object  size  is  still  present.  We  use  wavelet  filters  to  detect  the  existence  of  the  high 
clutter  regions  in  an  input  scene,  and  then  eliminate  these  regions  in  target  recognition. 

We  view  a  high  clutter  region  as  consisting  of  many  clutter  particles  with  different  sizes, 
shapes  and  with  different  degree  of  edge  smoothness.  Wavelet  transforms  are  well  known  to  be 
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Fig.  5.  (a)  Input  image  of  cars  in  a  toll  highway,  (b)  Closing  of  (a)  with  1x13,  (c)  Opening  of 

(a)  with  1  x  13,  (d)  Difference  of  (b)  and  (c),  (e)  Closing  of  (a)  with  17  x  1,  (f)  Opening 
of  (a)  with  17  x  1,  (g)  Difference  of  (e)  and  (f),  (h)  Minimum  of  (d)  and  (g). 
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able  to  detect  edges  of  varying  smoothness  [9].  We  use  §a(x)  to  denote  a~1/2$(x/a)  and  use 
similar  forms  for  \\fa (x)  and  \|/a (>’ ) .  For  each  scale  a,  there  are  three  output  images  as 

shown  in  Fig.  2.  In  Fig.  3b,  we  show  the  wavelets  for  three  different  scales 
(ax  =  ay  =  a  =  2,  4,  8).  Note  that  the  topest  scale  is  2.  From  observation,  the  high  clutter  regions 
exist  dominantly  in  scales  a  =  4  and  a  —  8.  We  combine  both  of  the  wavelet  functions  into  a  sin¬ 
gle  macro  wavelet  function  w(x,  y)  =  +  c 2^8 (^)Ws Cy)-  Then  we  smooth  the  out¬ 

put  with  a  Gaussian  low-pass  filter. 

5.  Enhanced  Graphical  User  Interface  for  Imagery  Toolkit  and  Its  Extensions 

The  Imagery  Exploitation  2000  (IE2000)  imagery  toolkit  was  initially  developed  at  Rome 
Laboratory  under  the  1993  AFOSR  summer  faculty  research  program  by  professors  Robert 
Stevenson  [15]  and  Robert  Snapp  [13].  Then  Audrey  Copperwheat  added  graphical  user  inter¬ 
face  for  some  functions.  In  this  project,  we  have  extensively  implemented  the  graphical  user 
interface  for  all  the  functions  of  imagery  toolkit.  Furthermore,  some  useful  image  processing 
rountines  such  as  morphological  operations  and  segmentation  are  added. 

The  IE2000  Imagery  Toolkit  is  a  general  purpose  government-off-the-shelf  imagery  exploi¬ 
tation  software  package  intended  for  distribution  to  Department  of  Defense  (DOD)  application 
developers  and  users.  It  was  developed  on  SunOs  4  and  we  upgrade  to  SunOs  5  or  Solaris.  It 
was  written  in  ANSI  C  to  make  portable  and  was  developed  using  Motif.  It  is  intended  to  be  a 
comprehensive  tool,  covering  all  the  fields  in  image  processing.  An  effort  to  include  most  of 
techniques  has  been  attempted,  but  due  to  limited  time  and  its  large  number  some  remain  to  be 
implemented.  Later  versions  will  include  missing  techniques  as  well  as  new  ones.  The  toolkit 
has  aimed  to  make  it  very  easy  to  upgrade  and  modify.  Users  can  quickly  learn  about  different 
techniques  by  inspecting  the  codes  and  can  enhance  the  system  by  either  implementing  new 
techniques  not  provided  or  improving  the  ones  provided. 

5i_l.  The  Main  Window  —  The  principal  interface  to  the  program  is  the  main  window.  It  consists 
of  three  areas:  the  menubar,  the  editing  icons  and  the  display  area.  Fig.  6  shows  the  main  win¬ 
dow. 

5.7.7.  The  menubar  —  There  are  1 1  pulldown  menus  located  in  the  menubar.  The  first  one  "File" 
is  used  to  call  read  and  write  dialogs,  and  to  clear,  restore  and  exit.  The  following  8  menus  carry 
out  often-used  image  processing  routines.  The  menu  "Interface"  intends  to  interact  with  other 
application  programs.  The  "Help"  menu  provides  on-line  information  about  the  application. 

5.J_,2.  The  editing,  icons  —  The  editing  icons  include  clockwise  or  counterclockwise  rotation, 
image  cutting,  zooming,  and  audio  interface. 
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Fig.  6.  The  main  window. 

5-J-J~  The  display  area  —  This  area  is  reserved  for  image. display.  If  the  image  size  is  larger  than 
the  display  area,  horizontal  and  vertical  scroll  bars  are  automatically  shown. 

5.2.  The  Read  and  Write  Dialogs  -  These  dialogs  are  used  to  select  a  file  to  read  or  write  an 
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image  from  or  to  an  imagery  server  or  disk.  Both  use  the  file  selection  box  widget  which  encap¬ 
sulates  the  task  of  opening  a  directory  file,  reading  its  entries  and  traversing  the  directory  tree. 
The  contents  are  displayed  in  two  list  widgets:  one  holds  the  directory  files  within  a  given  direc¬ 
tory  and  the  other  holds  ordinary,  link  and  device  files.  Selecting  a  file  and  traversing  the  direc¬ 
tory  tree  are  done  by  clicking  an  item  in  the  appropriate  list  widget.  The  format  supported 
currently  is  TIFF.  Fig.  7  shows  the  read  dialog. 


Fig.  7.  The  read  dialog. 


5.3.  The  Image  Processing  Routines  —  The  image  processing  routines  are  grouped  into  8 
categories:  manipulation,  enhancement,  feature  extraction,  filtering,  degradation,  statistics,  seg¬ 
mentation,  and  morphology. 

5.3.1.  Manipulation  -  The  image  manipulation  contains  zoom,  rotation,  crop,  flip  on  the  hor¬ 
izontal  or  vertical  axis,  show  an  overview  window,  set  parameters,  and  add  or  multiply  a  con¬ 
stant.  Fig.  8  shows  the  munipulation  pulldown  menus.  The  "Zoom"  menu  includes  3  pulldown 
menus:  zoom  in,  zoom  out,  and  original  image  restoration.  The  zoom  factor  can  be  set  in  the 
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menu  of  "Set  Parameters."  The  "Rotate"  menu  includes  3  pulldown  menus:  rotate  right  (clock¬ 
wise),  rotate  left  (counter-clockwise),  and  restore  to  0  rotation  (the  original  image).  The  rotation 
angle  (degree)  can  also  be  set  in  the  menu  of  "Set  Parameters."  The  "Show  Overview  Window" 
allows  the  creation  of  another  window  showing  the  image  processed.  This  makes  it  possible  to 
compare  serveral  images  processed  by  different  techniques.  The  "Add  Constant"  menu  calls  a 
pulldown  menu:  constant  to  add.  It  is  possible  to  increment  each  pixel  with  a  given  amount  to 
alter  the  brightness.  Similarly,  each  pixel  can  be  multiplied  by  a  given  ratio  to  alter  the  contrast 
by  using  the  "Multiply  Constant"  menu. 
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Fig.  8.  The  manipulation  pulldown  menus. 

5.32.  Enhancement  -  The  image  enhancement  contains  adjust  mean,  adjust  contrast,  adjust 
Gamma,  map  intensities,  invert  intensities,  stretch  range,  mask  image,  set  parameters,  and  histo¬ 
gram  equalization.  Fig.  9  shows  the  enhancement  pulldown  menus.  The  contrast,  Gamma,  and 
intensity  offset  can  be  selected  by  using  the  "Set  Parameters"  menu.  The  "Mask  Image"  is  to 
enhance  an  image  using  an  unsharp  mask  operator  whose  window  width  and  height  and  the 
alpha  value  used  as  the  fraction  of  the  highpassed  added  to  the  original,  can  also  be  selected  by 
using  the  "Set  Parameters"  menu. 

5.3.3.  Feature  Extraction  -  The  feature  extraction  contains  threshold  count,  etc.,  edge  detection, 
and  set  parameters.  Fig.  10  shows  the  feature  extraction  pulldown  menus.  The  "Threshold 
Count"  returns  the  number  of  pixels  above  threshold  or  below  threshold.  The  "etc."  reserves  for 
future  extension.  The  "Edge  Detection"  includes  the  Canny,  Frei  and  Chen,  Kirsch,  Marr  Hil¬ 
dreth,  Prewitt,  Roberts  X,  Robinson,  and  Sobel  edge  operators. 

5.3.4.  Filtering  -  The  image  filtering  contains  maximum,  average,  minimum,  cross  median, 
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Fig.  9.  The  enhancement  pulldown  menus. 
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Fig.  10.  The  feature  extraction  pulldown  menus. 


square  median,  cross  mean  trimmed,  square  mean  trimmed,  convolution,  and  Gaussian  smooth¬ 
ing.  Fig.  1 1  shows  the  filtering  pulldown  menus.  The  "Convolution”  menu  displays  a  pulldown 
menu  which  allows  the  interactive  input  of  a  3  x  3  mask.  The  "Gaussian  Smoothing"  menu 
allows  the  input  of  standard  deviation.  The  remaining  filters  allows  the  selection  of  either  3x3 
or  5  x  5  window. 


5.3  J.  Degradation  -  The  image  degradation  contains  random  noise  and  uniform  noise  additions. 
Fig.  12  shows  the  degradation  pulldown  menus.  The  random  noise  corrupts  individual  bits  in  the 
image  with  the  probability  "percentage."  The  "seed"  parameter  initializes  the  random  number 
generator.  The  uniform  noise  is  distributed  uniformly  over  the  range  from  -range  to  +range.  The 
"seed"  parameter  initializes  the  random  number  generator. 

5.3.6.  Statistics  —  The  image  statistics  contains  moment,  standard  deviation,  maximum,  mean 
and  minimum.  Fig.  13  shows  the  statistics  pulldown  menus.  These  buttons  return  the  statistical 
values  measured  for  the  image. 

5.3.7.  Segmentation  -  The  image  segmentation  contains  bilevel  threshold,  half  threshold,  and 
multilevel  threshold.  Fig.  14  shows  the  segmentation  pulldown  menus.  The  "Bilevel  Threshold" 
creates  a  binary  image  by  mapping  all  pixel  values  below  "Threshold"  to  0  and  all  above  to 
MAXPIXEL.  The  "Half  Threshold"  creates  a  grayscale  image  by  mapping  all  pixel  values 
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Fig.  11.  The  filtering  pulldown  menus. 


Fig.  12.  The  degradation  pulldown  menus. 
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Fig.  13.  The  statistics  pulldown  menus. 


above  "Threshold"  to  MAXPIXEL  and  all  below  to  their  original  values.  The  "Multilevel  Thres¬ 
hold"  creates  a  four-level  image  by  selecting  three  threshold  values  and  mapping  all  pixels 
values  in  between  to  0,  85,  170,  and  255,  respectively. 

iiieset  ISreetwl^ 

MtiRBeveJTlirisshold 

Fig.  14.  The  segmentation  pulldown  menus. 

5-2-8.  Morphology  -  The  mathematical  morphology  contains  binary  dilation,  erosion,  opening, 
closing,  soft  dilation,  soft  erosion,  soft  opening,  soft  closing,  set  parameters,  grayscale  dilation, 
and  grayscale  erosion.  Fig.  15  shows  the  morphology  pulldown  menus.  The  binary 


morphological  opertors  allow  the  selection  ofa3x3or5x5  flat  structuring  element.  The  soft 
morphological  operators  besides  the  same  selection  have  the  input  "rank"  set  up  in  the  "Set 
Parameters"  menu.  The  grayscale  morphological  operators  allow  the  input  values  of  a  3  x  3 
structuring  element. 


Fig.  15.  The  morphology  pulldown  menus. 

6.  Conclusions 

In  the  1995  AFOSR  summer  faculty  research  program,  we  have  accomplished  the  graphical 
user  interface  programming  for  IE2000  Imagery  Toolkit  and  the  extensive  routines  of  image 
morphological  operators  and  enhancement.  We  have  demonstated  that  the  toolkit  is  very  easy  to 
upgrade  and  modify.  The  future  extension  will  have  to  be  object  oriented  in  order  to  take  advan¬ 
tage  of  inheritance,  polymorphism  and  encapsulation.  The  language  of  choice  is  C++.  The  pro¬ 
gram  will  not  have  to  be  rewritten  from  scratch,  many  of  the  current  routines  can  be  modified  to 
be  useful.  The  overall  skeleton  of  the  program  will  be  maintained. 

We  have  investigated  a  new  image  compression  scheme  for  multiresolution  coding  algo¬ 
rithm  by  wavelet  transform  coding.  It  is  observed  that  the  energy  of  high  subbands  are  mainly 
concentrated  around  the  appropriate  edges  of  the  original  image.  A  simple  direct  vector  quanti¬ 
zation  to  encode  the  high  subbands  does  not  take  full  use  of  the  sparsely  distributed  nature  of  the 
high  subband,  and  waste  many  bits  to  encode  the  blocks  with  very  low  variance.  We  apply  a 
block  variance  detector  on  the  lowest  subband  to  predict  whether  the  high  subbands  contain  edge 
or  not.  We  have  also  experimented  with  the  wavelet  transform  combined  with  the  morphological 
close  minus  open  operation  to  extract  dark  or  bright  targets  compared  to  the  intensity  of  back¬ 
ground. 
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Connecting  CASE  to  Simulation  Development 
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Associate  Professor 
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Abstract: 

Object-Oriented  Computer-Aided  Software  Engineering  (CASE)  tools  are  needed  in  the 
process  of  simulation  development.  Simulation  development  is  similar  to  other  forms  of 
software  development,  though  there  are  differences.  We  have  developed  an  automated 
connection  of  Object-Oriented  CASE  software  development  tools  to  simulation 
development.  Specifically,  we  have  evolved  an  environment  in  which  the  Object-Oriented 
CASE  tool  ROSE™  is  connected  to  the  Object-Oriented  simulation  development 
environment  MODSIM™.  With  this  connection,  we  can  use  the  outputs  from  the  CASE 
tool  to  define  classes,  objects,  and  relationships  and  to  suggest  the  modularity  of  the 
simulation  system.  We  can  provide  (stubs  of)  code  from  the  CASE  tool  to  directly  aid 
coding  in  the  simulation  development  environment. 

Ideally,  the  connection  runs  both  ways,  producing  code  from  CASE  files  and  CASE  files 
from  code.  This  bidirectionality  keeps  the  implementation  and  documentation  of  the  class 
and  object  structures  in  step,  we  have  made  some  progress  towards  this  Round  Trip 
connection. 
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Connecting  CASE  to  Simulation  Development 


Jeffrey  W  Smith 


I.  Introduction: 

Software  development  is  an  exacting  task,  and  one  to  which  software  tools  have  been  late 
in  arriving.  Tools  for  Computer-Aided  Software  Engineering  (CASE),  programs  to  help  in 
the  development  of  programs,  have  contributed  to  progress  in  these  areas  of  software 
systems  development.  Software  tools  can  aid  in  the  traceability  of  design  and  capture  of 
design  assumptions,  requirements,  and  constraints  that  is  required  to  perform  the  validation 
and  verification  of  complex  systems.  There  is  increased  emphasis  on  the  early  stages  of 
system  definition  due  to  the  expense  of  missteps  in  this  phase  of  systems  development,  the 
advent  of  object-oriented  technology,  and  decreased  time  to  market  [TP83].  A  simulation 
development  is  much  like  any  other  software  development,  but  there  are  differences.  To 
develop  a  simulation,  as  with  any  software,  data  structures  must  be  defined,  related,  and 
communicated  and  procedural  steps  must  be  developed,  arranged,  and  ordered.  CASE 
tools  which  help  these  operations  can  be  applied  to  this  field.  However,  there  is  no  CASE 
tool  at  present  directed  at  the  specifics  of  simulation  development,  some  extension  or 
adaptation  of  CASE  tools  is  necessary  to  apply  them  to  the  simulation  development 
environment. 

In  a  recent  simulation  development  [BDA95a,  BDA95b,  Min92]  sponsored  by  the  Rome 
Laboratory  and  performed  by  Nichols  Research  Corp.  (NRC),  a  process  simulation  was 
developed  in  MODSIM,  a  simulation  language  derived  from  MODULA-2.  As  a  part  of 
their  study,  NRC  prespecified  the  simulation  objects  and  classes  in  the  CASE  tool  ROSE 
from  Rational  Corp.  Once  the  simulation  was  specified  in  ROSE,  the  specification  was 
passed  to  the  traditional  development  process,  and  any  further  benefits  from  the  definition 
phase  were  largely  foregone  as  a  functional  separation  developed  between  the  definition 
and  the  progressing  implementation.  This  gap  will  appear  as  the  system  is  developed  since 
there  are  updates  and  refinements  to  the  function  of  the  system  as  the  implementation 
proceeds,  and  these  are  seldom  reflected  in  updated  and  refined  defining  documents. 

The  gap  between  the  CASE  tool  and  the  software  development  environment  can  be  bridged 
by  an  automated  tool  that  connects  the  definition  domain  (of  the  CASE  tool)  with  the 
implementation  domain  (of  the  simulation  system).  The  bridge  is  formed  by  reading  in  a 
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portable  ASCII  file  format  output  by  the  CASE  tool,  analyzing  it  syntactically  and 
extracting  semantic  information  from  it  and  using  this  to  output  information  useful  in 
simulation  development.  This  syntactic  analysis  has  proceeded  by  the  process  or  creating 
or  adapting  a  standard-format  grammatical  notation  and  system-standard  language  analysis 
tools  (lex  and  yacc  [JL78]  in  this  case)  to  the  analysis  process.  There  is  some  exemplary 
material  in  this  report  on  how  the  bridge  was  built,  and  examples  from  each  of  the  analysis 
domains  is  included.  This  information  can  be  used  to  generate  the  definition  modules  and 
implementation  module  headers  for  the  MODSIM  simulation  system,  which  serves  to 
define  the  data  structures  and  their  relationships  for  the  development  of  simulations. 

The  structure  of  the  translation  system  which  connects  the  CASE  environment  (ROSE)  to 
the  simulation  environment  (MODSIM)  is  shown  in  Figure  1.  The  petal  files  which  are 
output  by  ROSE  are  read  in  by  a  program  named  "petal",  which  translates  (portions  of) 
them  to  MODSIM  code.  The  MODSIM  code  can  be  compiled  by  the  MODSIM  compiler 
(mscomp)  and  will  define  (portions  of)  the  simulation.  The  MODSIM  code  can  also  be 
accepted  by  a  program  called  "simula",  which  outputs  a  file  that  can  be  reintroduced  into 
the  CASE  environment.  Due  to  the  scale  of  the  translation  problem,  compiler  development 
tools  (lex  and  yacc  [JL78])  were  used  to  develop  them.  These  tools  and  their  application  to 
the  CASE-to-simulation  connection  are  detailed  below.  Keep  in  mind  in  this  discussion 
that  the  parser  is  the  program  that  does  the  translation.  When  we  define  a  (lexical  analyzer 
and)  grammar,  we  imply  a  parser  which  yacc  will  build.  We  then  run  this  parser  to  do  the 
translation. 

II.  Software  Definition. 

CASE  Tools: 

A  CASE  tool  is  any  program  that  helps  in  software  development,  modification,  or 
maintenance.  This  takes  in  a  lot  of  territory,  but  we  are  most  interested  in  the 
definition/specification  stage  that  occurs  at  the  very  beginning  of  the  process.  With  the 
advent  of  the  object-oriented  approach  to  software  definition  and  development,  there  has 
been  a  corresponding  development  of  object-oriented  specification  and  CASE  tools. 

The  use  of  CASE  tools  for  system  definition  and  initiation  of  development  is  an  idea  whose 
time  has  come.  CACI  has  a  graphics-centered  definition  environment  (SIMOBJECT)  and 
NRC  also  uses  one  (GENESys).  We  will  use  a  more  general  tool  for  Object-Oriented 
definition  (ROSE). 
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Figure  1 
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A  tool  that  can  actually  be  used  to  reduce  the  developer's  workload  or  make  him  more 
productive  is  what  is  needed  in  the  software  development  process.  Once  we  have  found  a 
CASE  tool  with  desirable  properties,  we  want  to  use  it  as  extensively  as  possible.  The 
definition  phase  of  software  system  development  is  an  upstream  process  in  the  sense  that  it 
has  leverage  on  the  subsequent  development  of  the  system.  It  can  (and  should)  also  serve 
as  an  overview,  a  document,  and  an  education  asset  for  the  system  throughout  its  lifetime. 
To  realize  these  benefits,  the  definition  information  must  stay  in  contact  with  the 
implementation. 

The  ROSE  CASE  Tool: 

Rational  Object-oriented  Systems  Engineering  (ROSE)  is  an  approach  and  a  software 
methodology  implemented  to  support  object-oriented  analysis  and  design  [Boo91,  Whi94a, 
Whi94b],  team  development,  and  code  generation  and  analysis  in  C++  and  Ada.  ROSE 
runs  on  the  PC  and  compatibles  with  Windows  3.1+,  and  does  have  a  code  generation 
capability,  in  C++  [Rat94],  Such  a  code  generation  capability  may  be  labor-saving  in 
development,  but  it  may  be  more  important  as  a  component  in  the  Round  Trip  concept 
[Boo91,  Whi94a,b],  since  the  generated  code  produces  the  defining  function  headers  and 
data  structure  boilerplate  and  documentary  annotations  that  make  the  Reverse  Engineering 
portion  of  the  cycle  possible  without  loss  of  information.  (In  Reverse  Engineering,  code 
which  may  have  been  updated  in  implementation  is  read  back  into  the  CASE  tool  to  provide 
an  updated  definition  and  documentation  for  the  system.)  The  code  generated  would  be 
directly  applicable  to  simulation  if  the  simulation  were  done  in  C++,  or  indirectly  applicable 
if  we  choose  to  translate  from  the  C++  stubs  produced  by  the  CASE  tool  to  the  simulation 
language  (MODSIM  in  our  case).  Instead,  we  have  chosen  to  make  the  translation  from  the 
ROSE  portable  ASCII  file  format  (the  petal  format)  to  MODSIM,  and  in  the  process  to 
learn  how  to  approach  the  translation  problem  so  as  to  be  able  to  provide  a  general 
connection  between  CASE  and  simulation,  which  will  bring  the  benefits  of  documentation, 
requirements  and  constraint  statements,  and  validation  capability  to  the  simulation 
development  environment. 
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Figure  2 

The  ROSE  approach  builds  a  definition  of  the  system  from  4  models  which  can  be 
specified:  logical,  physical,  static,  and  dynamic.  The  ROSE  user  creates  and  refines  these 
4  models  of  his  system  within  the  framework  of  a  general  model  representing  the  problem 
domain  and  the  software  system,  including  classes,  class  categories,  objects,  operations, 
states,  subsystems,  modules,  processors,  devices,  and  relations  between  them  [Boo91]. 
Icons  that  represent  these  aspects  of  the  model  appear  where  appropriate  in  the  ROSE  GUI, 
allowing  a  pictorial  view  and  manipulation  of  the  model's  application,  diagram,  and 
specification  aspects.  This  information  would  be  useful  in  implementation  in  addition  to 
the  planning  of  the  system;  it  is  presented  in  three  formats:  the  graphical  format  which  the 
GUI  provides  to  the  user  for  entry  and  manipulation,  a  printed  template  format,  and  the 
files  (called  petal  files)  that  the  system  uses  to  save  the  work  on  the  disk  between  sessions. 
The  petal  files  are  also  the  portable  format  of  the  system  information. 

III.  Software  Implementation. 

Simulation  Development 

As  the  field  of  computer  science  advances,  it  becomes  more  specialized  and  fragmented. 
The  simulation  community  is  one  such  fragment  -  simulation  developers  use  their  own 
languages  (SIMSCRIPT,  SIMULA,  GASP,  GPSS,  SLAM,  MODSIM,  etc.)  and  their  own 
development  environments  especially  tailored  to  the  simulation  task.  However,  there 
would  be  benefit  from  access  to  the  more  extensive  and  capable  development  tools  available 
in  the  mainstream  development  environment. 
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As  code  development  proceeds,  implementation  concerns  are  paramount,  with  an  inevitable 
diminishment  in  the  attention  paid  to  definition.  But  the  definition  remains  very  important, 
and  any  gap  between  the  definition  and  the  implementation  must  be  viewed  with  alarm.  We 
need  automated  tools  that  can  either/both: 

1 .  Produce  useful  implementation  code  from  the  definition  phase 

2.  Produce  an  updated  definition  from  a  changed  implementation. 

Since  the  definition  of  the  system  includes  the  most  accessible  documentation  of  the 
system,  useful  for  understanding  the  system,  training,  etc.  it  is  important  to  keep  this  up- 
to-date.  The  basic  idea  of  Round  Trip  Gestalt  Design  (notated  as  RTE  for  Round  Trip 
Engineering)  [Boo91,  Whi94a,  Whi94b]  is  that  no  development  effort  should  ever  be 
wasted,  so  all  the  work  that  is  done  must  be  applicable  to  the  final  system  design. 

However,  we  want  a  solution  (at  least  a  prototype  of  a  solution)  as  soon  as  possible.  In 
this  model  of  development,  progress  is  made  with  each  lap  around  the  cycle,  but  it  is 
important  to  have  the  capability  to  make  a  lap  around  the  cycle. 

In  order  to  do  the  RTE,  the  capability  to  translate  reliably  and  usefully  from  the  definition 
space  into  the  implementation  space.  For  instance,  the  ability  to  produce  class  definitions 
and  object  headers  from  the  definition  code  make  a  useful  connection;  if  this  is  well  done, 
changes  to  the  class  structure  can  be  done  in  the  definition  domain  rather  than  the 
implementation  domain;  which  would  serve  as  an  immediate  bridge  over  the  definition- 
implementation  gap. 

The  MODSIM  Development  Environment: 

MODSIM  II  is  a  general-purpose  modular  language  based  on  MODULA-2  and  with 
extensions  that  make  it  particularly  suitable  for  simulation  development  [MOD93a, 
MOD93b,  MOD94],  MODSIM  is  the  centerpiece  of  a  group  of  simulation  products  that 
extend  and  enhance  the  simulation  environment  (e.g.,  SIMGRAPHICS  II,  SIMMASTER, 
SIMOBJECT).  MODSIM  is  a  very  capable  simulation  development  environment  which 
has  been  used  to  produce  many  large  and  capable  simulations.  For  instance,  MODSIM  has 
been  used  to  simulate  the  traffic  flow  about  the  Chunnel,  and  is  nearly  a  standard  in  the 
simulation  of  airport,  air  traffic,  and  military  operations  [OP95]. 

IV.  Translation  Example. 
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If  we  want  to  translate  from  one  environment  to  another,  the  level  at  which  the  translation 
should  occur  is  the  first  issue  to  resolve.  If  both  the  CASE  tool  and  the  simulation  system 
have  a  GUI,  it  is  tempting  to  attempt  a  connection  at  this  level,  but  unless  the  GUI's  are 
identical  we  do  not  have  the  technology  for  this  level  of  translation.  If  both  have  templates, 
the  corresponding  fields  can  be  filled  in  to  provide  a  translation  at  this  level.  In  our  case, 
ROSE  has  both  a  GUI  and  templates  but  MODSIM  has  neither  presently  available  with  the 
language  product.  ROSE  also  has  files  (the  petal  files)  which  provide  the  ASCII  portable 
statement  of  the  CASE  tool  output,  and  MODSIM  has  a  well-defined  language  which  it 
accepts.  For  reasons  of  available  tools  and  definitions,  the  file-to-language  level  of 
translation  is  the  approach  we  have  adopted. 

Background  on  Grammars: 

"A  programming  language  can  be  defined  by  describing  what  its  programs  look  like  (the 
syntax  of  the  language)  and  what  its  programs  mean  (the  semantics  of  the  language) 
[ASU88,  p.25].  This  statement  is  also  true  of  files,  datasets,  network  transactions,  indeed 
most  computational  entities.  "For  specifying  the  syntax  of  a  language,  we  present  a  widely 
used  notation,  called  context-free  grammars  or  BNF  (for  Backus-Naur  Form)....  Besides 
specifying  the  syntax  of  a  language,  a  context-free  grammar  can  be  used  to  help  guide  the 
translation  of  programs...  known  as  syntax-directed  translation."  [ASU88,  p.25] 

A  grammar  is  a  formal  set  of  rules  and  procedures  for  the  analysis  or  synthesis  of  language 
constructs.  The  rules  are  called  productions,  and  have  the  form 
LHS  — >  RHS 

which  we  read  as  'the  left-hand  side  generates  the  right-hand  side',  or  'the  right-hand  side 
can  be  generated  by  the  left-hand  side'.  The  grammar  has  terminal  symbols  that  actually 
appear  in  the  language,  and  nonterminal  symbols  that  represent  intermediate  steps  in 
grammatical  analysis.  The  terminal  symbols  are  grouped  for  analytical  purposes  into 
lexical  tokens.  A  grammar  generates  member  strings  of  a  language  by  starting  at  the  start 
symbol  and  applying  productions  (replacing  the  LHS  with  the  RHS  until  there  are  only 
terminals  left.  This  process  is  said  to  work  top-down.  A  grammar  can  accept  (reject) 
strings  by  replacing  terminals  with  tokens,  then  replacing  RHS  of  productions  with  LHS 
until  only  the  start  symbol  is  left.  This  process  is  said  to  work  bottom-up.  Top-down 
analysis  works  out  readily  by  hand  and  appears  to  be  more  elegant,  but  bottom-up  methods 
apply  to  a  larger  class  of  grammars  and  are  more  amenable  to  computer  implementation. 
For  practical  problems,  the  describing  grammars  can  become  very  large  and  computers 
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must  be  used  to  process  them  (analyze,  check  for  conflicts,  reduce).  The  UNIX-based 
tools  lex  and  yacc  [JL78,  SF85,  ASU88,  Pys88]  have  become  a  standard  in  this  area,  and 
are  the  tools  we  made  use  of  for  this  translation  project.  When  a  defining  grammar  is  input 
into  the  computer  in  yacc  format,  the  program  will  build  a  parser  for  the  defined  language. 
There  are  frequently  performance  or  size  difficulties  with  this  very  general  definition,  but 
we  have  neglected  such  issues  for  this  prototype  development. 

Using  grammatical  analysis  on  linguistic  entities,  there  are  several  technical  issues  which 
must  be  addressed  -  ambiguity,  associativity  of  operators,  precedence,  etc.  Here  is  a  very 
small  example  grammar  for  a  subset  of  arithmetic  expressions: 

expr  -->  expr  +  term  I  I  expr  -  term  I  I  term 

term  -->  term  *  factor  I  I  term  /  factor  I  I  factor 

factor  -->  number  I  I  (  expr  ) 

In  this  example  grammar,  all  the  symbols  mentioned  are  nonterminals  except  for  ()+-*/. 
This  grammar  incorporates  the  precedence  of  */  over  +-  and  nested  expressions  using  (). 

The  process  of  analyzing  an  input  stream  to  determine  whether  or  not  it  is  a  member  of  the 
set  acceptable  to  the  grammar  is  called  parsing.  "A  parser  can  be  constructed  for  any 
grammar...  For  any  context-free  grammar,  there  is  a  parser  that  will  parse  a  string  of  N 
•  tokens  in  0(NA3)."  [ASU88,  p.48].  We  need  to  do  more  than  analyze  the  input,  we  need 
to  know  what  it  means  so  that  we  can  translate  it  into  another  format.  As  the  symbols  are 
encountered  in  the  parsing  process,  they  can  be  viewed  as  having  certain  attributes  (this 
word  is  the  accepted  terminology,  but  an  attribute  is  not  crisply  defined,  it  is  anything  that 
aids  the  translation  process).  The  grammar  with  attributes  attached  to  its  symbols  is  called 
an  attributed  grammar ,  which  can  be  used  in  the  process  of  syntax-directed  translation 
[ASU88,  p.34].  Here  is  a  part  of  the  grammar  above  with  attributes  attached: 


expr 

- > 

expr 

+ 

term 

expr . a  =  expr . a 

|  term. a  1 

'  + 

expr 

expr 

- 

term 

expr. a  =  expr. a 

I  term. a  1 

1  _ 

expr 

~  ~  > 

term 

expr. a  =  term. a 

term 

0  I  1 

1 

II  ... 

term. a  =  'O'  or 

'  1 ' 

This  grammatical  stub  shows  an  attributed  grammar  describing  a  program  that  will  convert 
an  arithmetic  expression  in  infix  notation  (3  +  4)  to  one  in  postfix  notation  (3  4  +).  Postfix 
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notation  is  useful  in  computers  and  calculators  for  arithmetic  since  expressions  can  be 
written  unambiguously  without  parentheses  and  because  they  can  be  evaluated  with  a  single 
operand  stack.  The  first  rule  states  that  if  the  production  (expr  +  term)  is  invoked,  the 
output  should  be  (expr  term  +).  Figure  3  shows  the  lex  and  yacc  files  for  a  complete 
implementation  of  this  function  (this  example  was  adapted  from  one  in  [ASU88]),  followed 
by  example  input  and  output  illustrating  the  operation  of  this  program.  This  method  of 
parser  development  (given  a  grammar)  is  well  documented  [JL78,  ASU88,  Pys88]. 

The  size  of  the  parse  produced  from  the  lexical  and  grammatical  specification  of  toy  is 
mentioned  here  for  purposes  of  comparison  with  the  later  parsers  we  developed: 

The  iex  portion  has  12  nodes,  29  partitions;  the  yacc  portion  has  9  terminals,  3 
nonterminals,  13  grammar  rules,  26  states. 

ROSE  Information  for  MODSIM  Development: 

As  objects  and  classes  are  developed,  they  do  serve  as  a  guide  and  template  for  further 
system  development.  Properly  done,  the  classes,  inheritance  and  relation  structures,  and 
objects  that  result  from  object-oriented  CASE  development  will  be  the  framework  for  all 
subsequent  development,  will  be  the  overview  of  the  system,  and  will  provide  an  education 
and  training  tool  over  the  lifetime  of  the  system.  At  least,  that  is  the  idea. 

In  practice,  once  the  classes  are  defined  and  their  relationships  clearly  depicted  in  whatever 
sort  of  diagram  the  developers  may  favor,  the  pressures  of  performance,  function,  and  time 
affect  the  structure  and  blur  the  formal  lines  of  definition.  The  major  goal  of  Round  Trip 
Engineering  is  to  prevent  this  gap  from  developing  by  keeping  the  initial  phases  of  system 
development  in  close  contact  with  subsequent  implementation. 
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/*  FILE  =  toy.l  jws:950821  */ 
%{ 

#include  "y.tab.h" 

extern  int  yylval;  /*  jws  */ 

% } 

/*  FILE  =  toy.l  jws : 950821  */ 


[  \t\n]  ;  /*  white  space  */ 

[0-9]+  {yylval=atoi (yytext ) ;  return ( INTEGER) ; } 

{return (yytext [0] ); }  /*  other  chars  */ 


/*  FILE  =  toy.y  jws:950821  */ 

/*  produce  lukasciewicz  (postfix)  expression  from  infix  */ 
%token  INTEGER 
%{ 

#include  "express. h" 
extern  char  yytext []; 

%} 


expr:  term 


1 

expr  ' + ' 

term 

{print f ( " 

+ 

" )  ; } 

1 

term: 

expr  1  -  ' 
factor 

term 

{printf ( " 

" )  ; } 

1 

term  1  * ' 

factor 

{printf ( " 

* 

" )  ; } 

1 

factor : 

term  ' / ' 

1  (  '  expr 

factor 
'  )  ' 

{printf ( " 

/ 

" )  ; } 

INTEGER 

{printf ( " 

%d 

", yylval) ; } 

1 + '  INTEGER 

{printf ( " 

%d 

" , yylval) ; } 

' - '  INTEGER 
' + '  ' ( ’  expr 

'  )  ' 

{printf ( " 

%d 

" , -yylval) ; } 

' - '  1 ( '  expr 

'  )  ' 

{printf ( " 

0- 

");}  ; 

%% 

int  yywrap () {return { 1 ); } 


#  sample  input /output  from  the  above 
lex. yy . c : if (yywrap () )  return(O);  break; 
toy : yywrap 


smith jw/YACC : 

echo 

3+4  I  toy 

3 

4  + 

smithjw/YACC : 

echo 

3+ (4+2)  I  toy 

Badly 

placed 

( )  '  s  . 

smithjw/YACC : 

echo 

"  3+  (4+2 ) "  1  toy 

3 

4  2  + 

+ 

smithjw/YACC : 

echo 

" (3+4)  +2  "  1  toy 

3 

4  +  2 

+ 

smithjw/YACC : 

echo 

"  -  (3+4)  +  2  "  I 

toy 

3 

4  +  2 

+ 

smithjw/YACC : 

echo 

"  -  (  3  +  4)  +  2  "  1 

toy 

3 

4  +  - 

2 

smithjw/YACC : 

echo 

"  -  (3  +  4)  +  2  "  1 

toy 

3 

4  +  0 

-  2  + 

smithjw/YACC : 

echo 

"  +  3  "  1  toy 

3 

smithjw/YACC : 

echo 

"  (3+4)  +2"  I  toy 

3 

4  +  2 

+ 

smith jw/YACC : 

echo 

"  3  *  (4  -  2 )  /  5 "  1  toy  3  4  2 

- 

*  5  / 

smithjw/YACC : 

echo 

"  -3  *  ( 4-2 ) /5 "  1  toy 

-3 

4  2 

-  *  5 

/ 


Figure  3 
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Connecting  ROSE  to  MODSIM: 

The  last  thing  a  person  needs  when  dealing  with  a  system  under  development  (get  it  to 
work,  explore  implementation  alternatives,  get  it  to  work  faster,  get  it  to  work  again)  is  any 
hitch  in  the  development  tools  that  distract  from  or  even  obscure  the  real  problem.  In  this 
context,  a  'seam'  in  the  development  environment  is  any  noticeable  break  in  the  uniformity 
of  the  process.  For  (specific)  example,  suppose  that  there  is  a  manual  procedure  that  must 
be  performed  in  the  midst  of  an  otherwise  automated  set  of  procedures.  The  fact  that  the 
manual  procedure  is  slower  and  less  convenient  will  serve  to  discourage  its  use.  If  the 
manual  procedure  is  a  part  of  a  development  process,  then  the  development  process  is 
slowed  or  undone  to  the  extent  that  the  procedure  is  distracting,  more  effort  (or  perceived 
as  such),  more  time  consuming,  and  more  error-prone.  'Seamless'  development  lacks 
discontinuities  in  the  process.  The  key  to  seamless  development  is  automation:  end-to-end, 
robust,  complete.  This  is  what  we  want,  as  does  anyone  facing  similar  problems  (for 
instance,  the  CAD  community). 

Once  the  files  have  been  captured,  those  portions  which  do  not  apply  to  the  software 
development  can  be  dropped  from  consideration.  For  instance,  there  are  portions  that  deal 
with  the  position  and  placement  of  the  class  and  object  representation  in  the  ROSE  GUI. 
While  this  information  is  valuable  when  working  with  the  ROSE  environment,  there  is  no 
need  to  transfer  it  to  the  MODSIM  environment. 

There  are  many  tools  which  aid  in  the  structural  analysis  of  files,  programs,  or  other 
computer  artifacts  which  have  structure.  These  tools,  of  which  lex  and  yacc  are  exemplary 
[JL78,  SF85,  Pys88],  require  a  grammar  as  input  to  specify  the  structure.  The  question 
then  becomes:  how  can  one  find  or  create  a  grammar  for  a  particular  structure  one 
is  trying  to  analyze?  There  are  two  ways  to  do  this,  with  compromises  and  variations 
possible: 

1.  Evolve  a  grammar  from  an  example  set  of  the  structures.  This  will  result  in  an  ad  hoc 
grammar,  since  it  will  describe  only  the  set  of  examples  for  which  it  was  developed,  but  if 
done  in  the  proper  spirit  (as  generally  as  possible  while  getting  the  problem  done),  a 
workable  grammar  can  be  produced  in  this  way. 

2.  If  the  structure  were  generated  in  compliance  with  some  grammar,  then  that  grammar 
will  also  serve  to  analyze  the  structure. 

Each  of  these  methods  was  used  to  develop  a  grammar  in  the  analysis  of  the  ROSE  (the 
petal  parser  developed  with  the  first  method)  and  MODSIM  (the  Simula  parser  developed 
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with  the  second  method)  structures  that  served  as  the  foundation  for  the  CASE-simulation 
connection. 

Inventing  a  Grammar: 

Suppose  that  you  have  no  structural  description  at  all,  but  suspect  (or  know)  that  there  is 
structure  present.  How  can  you  proceed  to  a  grammar  for  the  structure?  The 
procedure  is  to  develop  a  system  which  is  as  general  as  possible  and  obeys  certain  general 
principles  of  operation,  but  also  does  the  desired  example  as  a  special  case.  The  "does  the 
example"  part  will  accomplish  the  immediate  function.  The  "as  general  as  possible"  part 
will  provide  a  foundation  for  further  development  and  extensions. 

A  grammar  for  the  ROSE  files  was  developed  following  this  procedure.  The  files  from  the 
CASE  specification  of  the  BDASIM  simulation  provided  the  training  set  used  to  develop 
the  grammatical  rules.  Starting  with  the  shortest  file  (the  one  that  needed  the  fewest  rules  to 
analyze),  a  grammar  for  the  acceptance  of  ROSE  files  was  developed  by  the  addition  or 
modification  of  rules  in  the  grammar  as  cases  arose.  This  procedure  is  straightforward,  but 
it  is  laborious  and  can  become  intricate  as  the  size  and  complexity  of  the  grammar  increase. 
Also,  it  is  not  straightforward  to  be  complete  and  elegant  and  meaningful  and  semantic  all  at 
once.  As  a  general  course,  this  method  should  be  avoided  since  the  amount  of  work  to  an 
operational  grammar  is  unknown  a  priori.  However,  it  can  be  made  to  work  when  there  is 
no  alternative.  In  this  case,  the  ROSE  files  were  all  recognized  with  the  parser  "petal". 

The  file  set  consisted  of  22  .cat  files  (9094  lines)  and  2  .mdl  files  (934  lines)  for  a  total  of 
10028  lines.  The  .cat  (category)  files  and  .mdl  (model)  files  are  both  in  .ptl  (petal)  file 
format,  but  their  contents  differ.  The  grammar  went  through  14  versions  in  evolution 
(though  the  last  two  were  for  demonstration  purposes),  took  approximately  three  weeks  to 
develop,  and  ended  up  with  439  productions.  These  facts  apply  basically  to  syntactic 
recognition,  since  only  one  example  file  was  worked  through  the  semantic  portion  of  the 
translation.  Unfortunately,  this  process  does  not  guarantee  a  general  grammar  for  the 
ROSE  files,  only  a  grammar  as  general  as  we  could  make  it.  Here  are  summary  statistics 
for  the  petal  grammar  parser  we  developed: 

2277  nodes,  6802  positions 

168  terminals,  209  nonterminals,  439  grammar  rules,  622 
states 

Subsequently,  Rational  sent  us  a  description  of  their  file  reader,  from  which  we  could  infer 
that  they  used  the  yacc  approach  to  a  parser.  Their  grammar  appears  to  be  about  twice  the 
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size  of  "petal",  though  much  of  the  difference  may  come  from  the  fact  that  this  parser  is  for 
a  later  version  of  the  file  format  and  includes  the  code  generation  portions  of  the  file  format 
which  were  absent  from  the  BDASIM  petal  set.  Here  is  a  set  of  summary  statistics  for  the 
petal  grammar  from  Rational  Corp: 

331  terminals,  421  nonterminals,  909  grammar  rules,  1806 
states 

Adapting  a  Grammar: 

The  MODSIM  Reference  Manual  contains  a  BNF-format  grammar  for  the  MODSIM 
language.  This  grammar  carries  no  date  or  version  number  and  it  turns  out  to  be  not  quite 
the  grammar  for  the  language  (statements  which  are  rejected  by  the  grammar  are  accepted 
by  the  compiler.  For  instance,  the  production  (in  BNF  format)  for  a  formal  parameter  list 
is  published  as: 

FormalParameters  -->  (  [  FPSection  {  ;  FPSection  }  ]  )  [  :  ident  ] 

This  statement  includes  the  information  that  a  formal  parameter  list  must  be  encased  in 
parentheses  —  they  are  not  indicated  as  optional.  In  fact  (as  the  compiler  implements  the 
language),  they  are  optional  if  the  list  is  NULL.  There  are  several  divergences  of  this  sort 
between  the  grammar  and  the  compiler.  As  another  instance,  the  specification  of  the 
structure  CLASS  is  not  mentioned  at  all,  so  this  must  be  added  to  the  grammar  (using  the 
ad  hoc  method).)  Nonetheless,  this  grammar  is  close  enough  to  the  current  version  to 
make  a  useful  starting  point  for  grammar  development.  The  BNF  format  is  only  similar  to 
the  yacc  input  format,  so  the  information  must  be  translated  as  well  as  adapted  and 
extended.  The  working  grammar  was  developed  by  elaborating  the  published  grammar  in 
much  the  same  way  as  in  the  ad  hoc  method  used  for  the  petal  grammar.  The  resulting 
parser,  "simula",  accepts  all  65  Implementation  modules  (15862  lines),  all  65  Definition 
modules  (3364  lines)  and  all  7  Main  modules  (1246  lines)  included  in  BDASIM.  The 
Simula  parser  went  through  6  versions  and  took  two  weeks  to  develop.  We  mad  no  attempt 
to  add  semantics  to  this  parser.  Again,  this  does  not  guarantee  that  this  parser  is  general, 
but  since  the  MODSIM  grammar  that  was  used  as  a  starting  point  must  reflect  the  structure 
of  the  language,  it  is  probably  pretty  general. 

To  complete  a  lap  around  the  development  domain,  definition  must  be  extractable  from 
implementation  information.  We  will  have  made  some  progress  in  this  area  with  the 
analysis  of  MODSIM  code  and  the  generation  of  readable  petal  files  from  it.  Here  is  a  set 
of  summary  statistics  for  simula.y: 

666  nodes,  2558  positions 
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95  terminals,  166  nonterminals,  300  grammar  rules,  489  states 

V.  Conclusions  and  Further  Work: 

Conclusions: 

We  have  shown  how  one  specific  CASE  tool  can  be  applied  to  one  specific  simulation 
development.  We  have  redone  exemplary  portions  of  the  simulation  development  to  show 
the  improvement  in  the  development  process  effected  by  the  CASE  tool.  The  Rational 
Corp.  CASE  tool  ROSE  has  been  applied  to  the  software  development  for  the  MODSIM 
development  environment.  We  used  the  UNIX  development  tools  to  translate  the  outputs 
of  the  ROSE  system  for  Object  and  Class  definitions  and  refinements  to  the  inputs  required 
by  the  MODSIM  development  environment  (essentially,  MODSIM  source  code  statements; 
this  is  mostly  a  code  translation  process  at  the  syntactic  level,  but  the  semantics  have  had 
some  influence  as  well). 

Tliis  approach  can  be  made  to  work  (as  the  limited  example  shows),  but  the  semantics  are 
going  to  be  even  harder  than  the  syntax.  Also,  the  ad  hoc  nature  of  the  solution  means  that 
the  very  next  file  may  present  a  new  case  that  causes  the  parser  to  fail.  This  approach  is  not 
robust,  so  it  is  not  a  good  idea  to  use  it  to  build  a  tool  for  the  critical  path  of  system 
development. 

Further  Work: 

To  proceed  to  a  robust  connection  between  ROSE  and  MODSIM: 

1.  Get  the  attributed  petal  grammar  and  associated  lexical  definition  from  Rational. 

2.  Get  the  most  up-to-date  MODSIM  grammar  from  CACI. 

Proceed  to  mate  these.  The  methods  used  in  development  in  this  project  can  serve  as  a 
guide  in  this  procedure,  they  have  been  shown  to  be  effective. 

On  a  more  general  note,  the  idea  of  a  closer  connection  between  the  definition  and 
implementation  phases  of  system  development  must  be  pursued.  A  properly  defined 
system  can  be  prototyped,  have  early  simulation  results,  can  be  connected  to  other  software 
to  preview  system  integration,  has  more  credibility  throughout  the  life  cycle  of  the 
software.  The  benefits  from  formal  definition  are  very  great,  but  it  is  a  lot  of  effort.  If  that 
effort  can  be  made  to  pay  off  by  jump-starting  the  implementation  phase,  it  is  a  winner! 
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Abstract 


Every  time  an  order  of  magnitude  improvement  occurs  in  hardware  speed,  it  requires  a  qualitative  reexami¬ 
nation  of  system  design  and  software  architecture.  Every  major  advance  in  hardware  has  left  software  engi¬ 
neers  and  system  designers  looking  for  operating  system  structures  to  harness  new  capabilities  and  applica¬ 
tions  that  are  in  need  of  increased  performance.  High-Bandwidth  ATM  Networks  are  no  different  than 
other  major  advances  in  this  respect.  This  study  examines  low  level  system  support  for  ATM  networks.  In 
addition,  experimental  measurements  show  that  harnessing  the  potential  of  ATM  in  existing  applications 
requires  appropriate  low-level  system  support.  Two  applications  are  used  as  examples  where  the  benefits  of 
ATM  can  be  immediately  applied. 
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Application  of  ATM  Networks  in  Distributed  Systems 


Scott  Spetka 


Introduction 

We  begin  by  presenting  performance  measurements  that  indicate  the  potential  performance  advantage  of 
ATM  and  the  importance  of  appropriate  software  interfaces  to  exploit  improved  speed.  In  addition,  as  we 
look  into  the  future  of  this  technology,  we  also  consider  performance  measurements  which  will  help  us  to 
gauge  the  potential  increased  demand  for  this  ATM,  to  support  the  types  of  applications  that  we  propose. 

One  application  that  already  suffers  from  an  acute  shortage  of  bandwidth  is  the  Internet-based  world-wide 
digital  broadcast  system.  Mbone  (the  multicast  backbone)  [http://www.best.com/~prince/techinfo]  uses  a 
tree-structure  to  broadcast  real-time  audio,  video,  and  other  multimedia  presentations.  To  use  network 
bandwidth  efficiently,  the  Mbone  multicast  routers  form  a  store-and-forward  tree,  using  the  underlying 
Internet  for  communications  between  mrouters  (see  Figure  1).  Mbone  users  have  kernel  support  for  receiv¬ 
ing  broadcast  packets  directly  from  an  mrouter,  through  an  ethernet  interface. 


21-3 


World-wide  distribution  of  data  has  grown  exponentially  over  recent  years  with  the  World-Wide  Web  as  the 
primary  tool  for  access.  Some  automation  of  the  browsing  process,  using  indexes  that  can  be  built  periodi¬ 
cally  and  active  robots  [http://www.cs.sunyit.edu/Robots/tkwww.html]  gives  users  the  capability  to  initiate 
a  WWW  query  that  may  require  access  to  distributed  data  resources.  The  WWW  is  increasingly  being  used 
for  access  to  databases.  Interfaces  like  GSQL  [ftp://ftp.ncsa.uiuc.edu/Web/tools/gsql]  are  used  for  access  to 
relational  databases  through  WWW  browsers.  Increased  availability  of  Databases  and  their  usage  is  clearly 
related  to  the  drop  in  price  of  disk  storage  from  $  1.00/MB  two  years  ago  to  around  $. 20/MB  today.  The 
increase  in  access  of  globally  distributed  information  will  require  increased  use  of  high-bandwidth  net¬ 
works  to  support  the  growing  demand. 
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Performance  Measurements 


A  performance  study  compared  two  protocols  used  to  access  the  ATM  network  interfaces,  tcp-ip  and  the 
ATM  application  interface  (API).  Table  1  shows  the  difference  in  performance  that  was  measured  for  a  tcp- 
ip  connection  to  an  ATM  interface  and  a  connection  made  through  the  ATM  API.  The  measurements  show 
that  the  ATM  API  provides  a  significant  improvement  over  tcp-ip.  The  measurements  were  made  using  the 
netperf  program  [http://www.cup.hp.com/netperf/NetperfPage.html].  Performance  experiments  were  con¬ 
ducted  using  a  FORE  ATM  switch  with  Taxi  interface  cards  in  each  of  two  systems  communicating  through 
the  switch  (a  Sparc  20  and  an  HP  755). 

The  tcp-ip  protocol  provides  for  reliable  and  robust  communication  through  a  relatively  unreliable  network 
where  congestion  and  failures  can  require  that  each  packet,  even  in  a  connection  oriented  protocol,  may 
have  to  take  a  different  path  through  the  network.  Packet  reassembly  and  retransmission  requests  are  sup¬ 
ported  well  by  this  protocol.  The  overhead  for  this  processing  was  small,  compared  to  slow  network  speeds 
when  networks  were  limited  to  10  Mbits/sec.  The  functionality  of  tcp-ip  is  necessary  for  unreliable  net¬ 
works. 

Protocols  for  ATM  networks  assume  the  reliability  of  the  switches  on  a  communication  path.  Advance 
reservation  of  resources  allows  a  guaranteed  availability  of  resources  to  satisfy  the  needs  of  each  applica¬ 
tion.  This  closer  relationship  between  the  application  and  the  ATM  API  allows  a  reduction  in  overhead  and 
reservation  of  appropriate  network  resources  to  support  high-speed  communication. 
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Benchmark 

Test 

Message 
Size  (bytes) 

Throughput 

10'6bits/sec 

TCP  STREAM  TEST 

16384 

38.88 

FORE  UNIDIRECTIONAL  SEND  TEST 

4096 

110.88 

Table  1  -  Comparison  of  TCP-IP  and  ATM/API 

It  is  also  important  to  consider  the  future  demand  for  ATM  networks.  A  486Dx2/66  processor  is  not  capa¬ 
ble  of  driving  an  ethemet  interface  at  more  than  5Mbits/sec.  under  the  BSD/OS  Unix  system 
[http://www.bsdi.com].  However,  the  table  below  uses  standard  benchmarking  programs 
[ftp://ftp.cs.sunyit.edu/pub/BENCH]  to  demonstrate  that  a  modern  PC  running  BSD/OS  can  perform  com¬ 
parably  to  an  HP-755.  This  increase  in  processor  performance  for  inexpensive  PCs  is  bound  to  result  in 
increased  demand  for  high-speed  applications.  Software  needs  to  be  developed  to  allow  the  global  network 
to  meet  the  expected  increase  in  demand  from  this  market. 


Benchmark 

HP-755 

P-120  Pentium 

MFLOPS 

13 

9 

Dhrystones 

55,000 

125,000 

Table  2  -  Comparison  of  Unix  Workstation  and  Modern  PC 
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Conclusion 


This  report  shows  the  potential  for  improved  application  performance  through  use  of  ATM  networks  when 
application  software  is  adapted  to  use  the  ATM  application  interface  and  avoid  the  overhead  for  TCP/IP 
support  of  network  reliability.  Additional  experiments  need  to  be  performed  in  videoconferencing  applica¬ 
tions  and  distributed  database  applications  that  are  described  above. 
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ABSTRACT 


Intrasubband  and  intersubband  scattering  rates  of  heavy  holes  are  obtained  due  to 
confined  nonpolar  optical  phonons  in  a  Sii_rGer  quantum  well  with  Si  barriers.  Guided 
and  interface  Ge-Si  and  Ge-Ge  modes  and  unconfined  Si-Si  modes  are  considered.  A 
continuum  model  is  used  for  the  two  components  of  the  ionic  displacement  of  con¬ 
fined  vibrations:  the  uncoupled  s-polarized  TO  mode  and  the  hybrid  of  the  LO  and 
p-polarized  TO  modes.  The  guided  mode  is  obtained  using  the  model  of  a  quantum  well 
with  infinitely  rigid  barriers,  and  the  interface  mode  is  derived  from  the  hydrodynamic 
boundary  conditions.  While  the  total  intersubband  scattering  rates  are  reduced  as  a 
result  of  confinement,  the  opposite  is  found  for  the  intrasubband  scattering.  Depending 
on  the  well  width  and  Ge  content,  the  intersubband  scattering  rates  are  reduced  by  a 
factor  of  two  to  four  with  respect  to  their  values  for  no  confinement.  Thus,  one  would 
expect  comparable  enhancement  in  the  intersubband  lifetimes  crucial  to  the  population 
inversion  in  a  Sii_rGex/Si  intersubband  laser. 
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HEAVY-HOLE  SCATTERING  BY  CONFINED 
NONPOLAR  OPTICAL  PHONONS 
IN  A  SINGLE  Si^GeJSi  QUANTUM  WELL 

Gang  Sun 

I.  INTRODUCTION 

The  possibility  of  lasing  due  to  intersubband  transitions  in  quantum  well  (QW) 
structures  is  crucially  dependent  on  the  lifetimes  of  the  involved  subbands.  In  an  ear¬ 
lier  communication1  lifetimes  were  calculated  for  a  Sii-^Ge^/Si  multiple  quantum  well 
(MQW)  structure  due  to  acoustic  and  nonpolar  optical  phonon  scattering.  It  was  pointed 
out  that,  because  of  the  absence  of  polar  optical  scattering  for  silicon  based  systems, 
the  lifetimes  and  their  differences  are  consistently  an  order  of  magnitude  larger  than 
for  the  GaAs/AlGaAs  system,  and  do  not  show  the  marked  decrease  when  the  inter¬ 
subband  energy  exceeds  the  optical  phonon  energy.  For  this  case,  the  scattering  was 
calculated  assuming  bulk  propagating  phonons  corresponding  to  the  average  composi¬ 
tion.  As  pointed  out  in  that  paper,  it  is  expected  that  the  treatment  of  bulk-like  phonon 
scattering  leads  to  an  overestimate  of  the  intersubband  transition  rates,  and  therefore 
provides  a  conservative  approach  in  determining  the  subband  lifetimes.  In  order  to 
more  accurately  determine  the  subband  lifetimes,  it  is  necessary  to  take  into  account 
of  confinement  effects  on  optical  phonons  in  heterostructures.  In  a  Sii-^Ge^/Si  QW 
structure,  one  can  expect  Si-Si,  Ge-Si,  and  Ge-Ge  optical  modes  in  the  alloy.2  The  Ge-Si 
and  Ge-Ge  modes  tend  to  be  confined  by  the  Si  barriers  as  has  been  seen  in  Raman  scat¬ 
tering  experiments.3  The  phonon  confinement  gives  rise  to  guided  and  interface  modes 
which  will  scatter  the  carrier  less  than  will  bulk  phonons  for  intersubband  transitions, 
thus  resulting  in  revised  subband  lifetimes.  This  is  due  to  the  discrete  spectrum  of  the 
guided  modes  and  the  weak  intersubband  scattering  of  interface  modes.  However,  the 
intrasubband  scattering  process  tends  to  be  enhanced  due  to  confinement. 

There  has  been  a  great  deal  of  effort  in  dealing  with  the  issue  of  phonon  con¬ 
finement  in  heterostructures.2’4-15  Most  of  published  literature  have  focused  on  confined 
polar  optical  phonons,  because  a  large  body  of  the  heterostructures  are  constructed  from 
polar  semiconductor  materials.  Recently,  the  demonstration  of  an  infrared  intersubband 
transition  laser  in  the  InGaAs/AlGaAs  material  system16  has  renewed  the  interest  of  in¬ 
vestigating  the  possibility  of  constructing  lasers  within  the  Ge/Si  material  system,  which 
eventually  would  allow  monolithic  integration  of  optical  components  with  advanced  Si 
microelectronics.17 

A  self-consistent  continuum  theory  of  confined  nonpolar  optical  phonons  has  been 
developed  in  an  infinite  plate  with  free  boundary  conditions.4  In  the  present  paper,  we 
extend  that  continuum  theory  to  examine  the  confined  nonpolar  optical  phonons  of  Ge-Si 
and  Ge-Ge  vibration  modes  with  proper  mechanic  boundary  conditions  in  a  Sii-^Ge^/Si 
QW.  Furthermore,  we  use  these  results  to  estimate  the  heavy-hole  scattering  rates  by  the 
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nonpolar  optical  phonons.  To  the  best  of  our  knowledge,  there  has  not  been  any  work 
in  estimating  the  scattering  rates  due  to  the  nonpolar  optical  phonons  in  Sii-^Ge^/Si 
heterostructures  taking  into  account  the  phonon  confinement. 

The  three  different  vibration  modes,  namely,  Si-Si,  Ge-Si,  and  Ge-Ge,  are  con¬ 
sidered  separately  and  are  given  proper  weights  in  heavy-hole  scattering  calculation. 
The  Si-Si  mode  propagates  freely  throughout  the  structure  and  is  therefore  treated  as 
bulk-like,  while  the  Ge-Si  and  Ge-Ge  modes  are  confined  by  the  barriers,  resulting  in 
guided  and  interface  modes.  Approximations  made  in  the  course  of  the  calculations  for 
the  guided  and  interface  modes  will  be  discussed.  We  will  examine  both  intrasubband 
and  intersuband  scattering  processes.  We  will  also  compare  these  results  with  those 
assuming  bulk  propagating  phonons.  We  will  show  that  indeed  the  assumption  of  bulk¬ 
like  optical  phonons  overestimates  the  scattering  rates  for  intersubband  transitions,  but 
tends  to  underestimate  the  intrasubband  process.  The  difference  is  largely  attributed  to 
the  heavy-hole  interaction  with  interface  modes  which  can  be  neglected  for  the  intersub¬ 
band  process,  but  contributes  significantly  to  the  intrasubband  process.  The  scattering 
rates  due  to  various  scattering  modes  will  be  investigated  as  a  function  of  the  well  width 
and  as  a  function  of  the  alloy  composition  of  the  well. 

II.  PHONON  CONFINEMENT 

Phonon  confinement  occurs  due  to  the  lack  of  overlap  of  the  bulk  frequency 
dispersions  in  the  adjacent  heterostructure  materials.  Thus  in  a  Si/Ge/Si  short  period 
superlattice,  the  higher  frequency  optical  modes  of  Si  correspond  to  an  frequency  gap 
in  the  Ge  layer  and  are  therefore  confined.  The  Ge  modes  in  the  Ge  layers  are  strictly 
resonant  with  the  Si  acoustic  modes  and  are  ‘quasi-confined’,  but  the  displacement 
pattern  is  similar  to  that  of  a  true  confined  mode  and  indeed  the  Raman  intensities  are 
comparable.2.  The  Raman  spectrum  has  revealed  that  there  exist  Ge-Ge,  Ge-Si,  and 
Si-Si  vibration  modes  in  the  Sii_rGex  alloy  layer.3  The  Ge-Ge  vibrations  have  phonon 
energy  ZlAmtV  and  the  Ge-Si  vibrations  50.8 meV.  These  two  modes  of  the  Sii_xGex 
well  are  confined  by  the  Si-Si  modes  of  the  barrier  (photon  energy  64.3meE);  however, 
the  Si-Si  optical  modes  of  the  well  are  unconfined,  as  are  the  Si-Si  optical  modes  of  the 
barrier. 

One  of  the  simplest  conceptual  model  is  to  treat  the  QW  system  with  infinitely 
rigid  barrier.  The  boundary  condition  to  be  satisfied  is  the  vanishing  of  the  ionic  dis¬ 
placement  of  all  confined  vibration  modes.  This  is  an  assumption  of  strict  confinement, 
yielding  guided  modes  of  the  confined  phonons.  This  assumption  needs  to  be  relaxed 
in  order  to  admit  interface  modes.  As  pointed  out  in  the  continuum  theory,4  the  ionic 
displacement  of  confined  vibrations  has  two  components:  one  is  the  hybrid  of  the  LO 
and  p-polarized  TO  (p-TO)  modes,  and  other  is  the  uncoupled  s-polarized  TO  (s-TO) 
mode.  These  modes  are  defined  as  follows:  If  we  consider  a  (x,  z)  plane  containing  the 
normal  to  the  layers  and  the  phonon  wavevector  Q,  then 

Q  —  C[xex  +  Qzez  (1) 
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where  ex  and  e,  are  unit  vectors.  The  p-TO  mode  has  its  displacements  normal  to  Q 
and  in  the  plane,  while  the  s-TO  displacements  are  normal  to  Q  and  perpendicular  to 
the  plane  (||ey).  A  description  of  the  s-TO  mode  is 

uy  =  eiqxX{Aeiq*z  +  Be~iq*z),  (2) 

while  the  hybrid  of  the  LO  and  p-TO  modes  is  given  by 

ux  =  eiqxX[qx(CeiqLZ  +  De~iq LZ)  +  qT{Eeiq^  +  Fe^*)},  ,  , 

uz  =  eiqxX[qh(CetqLZ  -  De~iqLZ)  -  qx{EeiqTZ  -  Fe~tqTZ) ].  {  } 

The  2-components  of  the  LO  and  TO  wavevector  have  been  distinguished  by  qi  and  qx , 
respectively. 

The  above  choice  for  the  ionic  displacement  guarantees  that 

V  x  ul  =  0  and  V  •  ut  =  0,  (4) 

which  hold  for  isotropic  materials  (assumed  here),  allowing  u  to  be  decomposed  into  LO 
and  TO  components, 

u  =  uL  +  uT.  (5) 

Since  the  LO  and  TO  modes  must  have  the  same  frequency  to  be  effectively 
coupled,  they  have  to  satisfy,  according  to  the  bulk  LO  and  TO  dispersions, 

u)2  =  -  f32L{ql  +  q\)  =  Wq  —  Privl  +  (®) 

where  /?£,  and  fix  are  the  velocities  of  LO  and  TO  dispersions,  respectively. 

Guided  Modes 


In  the  case  of  strict  confinement,  the  boundary  condition  of  course  is  that  the 
displacements  u  vanish  at  the  boundaries  2  =  (0 ,L).  For  the  s-TO  mode,  this  leads  to 

uy  =  AeiqxX  s'm(qzz)i  with  qz  =  —  (7) 

where  n  =  1,2,---.  This  mode  does  not  mix  with  other  modes,  nor  does  it  give  rise  to 
the  interface  mode  discussed  later. 

For  the  coupled  LO  and  p-TO  modes,  applying  the  boundary  conditions  of  ux  =  0 
and  uz  —  0  at  2  =  (0,  L)  to  Eq.(3)  leads  to  a  system  of  four  constant-coefficient  linear 
equations  for  C,  D ,  E,  and  F.  The  vanishing  of  the  determinant  of  the  4x4  constant 
coefficient  matrix  gives  rise  to  the  following  relation, 

( qrqL  +  ql)2  sin (qTL)  sin (qLL)  =  2qTqLq2x[cos(qT  -  qL)L  -  1].  (8) 


Eq.(8)  leads  to  solutions  of  guided  modes  consisting  of  coupled  phase-matched 
LO  and  TO  modes  with  wavevectors 


nh-K 

L 


and 


rix  7T 

IT’ 


(9) 
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where  iil  =  1,2,-  ■  - ,  nx  =  3,4,  -  -  - ,  and  nj  —  =  2,4,6,---.  This  choice  of  quantum 

numbers  is  due  to  the  constraint  Eq.(6)  since  f3j  <  @l- 


Two  sets  of  guided  mode  solutions  emerge  and  they  are  given  in  either  sine  or 
cosine  form  referring  to  the  2-component  of  the  ionic  displacement.  The  ‘sine’  solution 
is 


ux  —  2C  eiqxX  qx[cos{qLz)  -  cos(grz)], 

a2 

uz  -  2iC elkxX[qLs'm{qLz)  +  —  sin(qTz)], 

qT 


(10) 


and  a  ‘cosine’  solution 


^ X 


=  2iCetQxX[qxsm(qLz)  +  sin {qrz)}, 

(lx 

=  2C eiqxX qL[cos(qLz)  -  cos(gr2)]. 


(11) 


It  is  worthwhile  to  point  out  the  similarities  between  the  allowed  values  of  qz,  q^, 
qr  and  qx  for  the  guided  modes  in  the  case  of  infinitely  rigid  barriers  assumed  here  and 
that  ol  an  infinite  plate  with  free  boundary  conditions.4  But  the  displacement  patterns 
differ  dramatically  between  the  two  situations. 

Interface  Modes 


The  assumption  of  strict  confinement,  that  the  vibration  amplitudes  are  zero 
at  the  interfaces,  rules  out  the  possibility  of  interface  modes.  In  general  such  modes 
exist  and  are  of  increasing  importance  in  carrier  scattering  intrasubband  transitions  in 
comparison  with  guided  modes.  The  assumption  of  strict  confinement  must  therefore  be 
relaxed  to  admit  such  modes.  It  is  obvious  that  the  amplitudes  of  interface  modes  at  the 
boundaries  should  remain  small  in  order  to  keep  the  assumption  of  strict  confinement 
yielding  the  guided  modes.  In  fact,  we  need  only  to  relax  the  vanishing  assumption  of 
the  .r-component  of  the  displacement  to  admit  interface  modes  while  maintaining  the 
zero  boundary  condition  on  the  2-component.  A  similar  boundary  condition  has  been 
employed  in  treating  the  polar  optical  phonons  for  a  single  GaAs/AlAs  QW.10 

Applying  the  hydrodynamic  boundary  conditions5  at  the  interfaces  to  the  LO  and 
TO  modes  independently  and  considering  the  fact  that  the  optical  phonon  frequency  in 
the  barrier  region  is  greater  than  that  in  the  well  region,  we  obtain  only  a  solution  with 
even  2- component  for  the  LO  mode, 


ul 


CetqxX(qxex  -  qL2ez)e  tqL2Z  z<  0, 

<  CetqxX(—)1/2(  ——qxex  -  qL2ez )  0  <  z  <  L, 

Pi  Vi  P l 

—CeiqxX(qxex  +  qL2ez)eiq z  >  L, 


(12) 


where  the  subscripts  1  and  2  refer  to  well  (Sij-^Gej;)  and  barrier  (Si)  regions,  respectively, 
Pi  is  the  material  density,  and 

r]t  =  u2/u2  -  1,  (13) 
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where  cu,  is  the  T-point  optical  phonon  frequency  in  layer  i  (=  1,2).  For  both  Ge-Si  and 
Ge-Ge  conhned  modes  we  have  lo 2  >  u)\.  The  LO  wavevector  in  the  well  region  qn  =  0 
and  that  in  the  barrier  region, 


<?L2 


LJcf  —  LO-\ 


(14) 


The  LO  mode  solution  is  of  two-dimensional  bulk  type  with  constant  amplitudes  in  the 
well  region  propagating  with  wavevectors  parallel  to  the  interfaces. 


Both  odd  and  even  solutions  emerge  for  the  TO  mode,  but  the  boundary  condition 
that  the  z-component  of  u^  +  Ut  vanishes  at  the  interfaces  admits  only  the  even  solution, 


ur 


'  De^(qT2ex  +  qxez)e-'^z 

ne‘ixx(P iu/2 _ I _ 

<  p2  cos(qTiL/2) 

x  [-iqn  sin  qT\{z  -  L/2)ex  +  qx  cos  qTi(z  -  L/2)ez\ 
.  —  DeiqxX(qT2ex  -  qxez)eiq^-L) 


z  <  0, 

(15) 

0  <  z  <  L, 
z  >  L, 


where  the  wavevectors  for  the  TO  mode,  according  to  Eq.(6) 

0ri  =  (§r 1“1)9x  (well)> 

02  (16) 

?T2  +  (barrier). 

Pt  2  Fr2 


Combining  Eqs.(12)  and  (15)  and  applying  that  the  z-component  of  u l  +  Ur  vanishes 
at  the  interfaces,  lead  to  D  =  (qL2/qx)C,  and  finally  the  interface  mode  is  obtained, 


CeiqxX[{qxe-iq^z  + 


(lL2qT2  ! 


iqr?z 


qx 


)ex 


U  =  < 


+{-qL2C  lqL2Z  +  qL2&  i?T22)e2] 
Pl_\l/2  IT7?2  P2  -  i  <!L2qTl 

P2 


CeiqxX(— y/2  qx 


hi  Pi  cos(qT1L/2)  qx 
rcos  qTi{z  —  L/2)  1]  A1 

+qL2[ — 7777:  FTon 


sin  <7x1(2  -  T/2)]e2 


cos(gnT/2) 


z<0, 


0  <  z  <  T, 


CeiqxX  { [— <7xeI9i2(2-L)  - 


<7L2<7X2  ,■ 


*<7T2 


F-G]f 


+  [-?L2ei?i2(2"L)  +  <?L2ei<7T2(2"L)]e,} 


z  >  L. 


(17) 


The  dispersion  of  bulk  Si-Si  mode  has  little  overlap  with  that  of  Ge-Si  mode. 
The  scattering  process  involves  only  long  wavelength  in-plane  wavevectors,  and  the  z- 
components  of  phonon  wavevectors  in  the  barriers,  qL2  and  qx 2,  are  approximately  the 
dimension  of  the  Brillouin  zone.  This  is  because  the  dispersion  relations  overlap  only  at 
large  phonon  wavevectors  in  the  barrier  region.  Since  qn  =  0  and  qx  ~  0,  the  interface 
mode  frequency  u  ~  therefore,  772/hi  >>  1-  The  major  scattering  contribution  from 
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the  interface  mode  is  from  the  displacement  in  the  well  region  where  the  heavy-holes 
are  confined.  The  amplitude  of  the  LO-component  is  much  greater  than  that  of  the 
TO-component  in  Eq.(17)  for  0  <  z  <  L,  since 


^2  P2_  ^  QL2(/T1 

m  Pi  '  <ll 


(18) 


considering  q-pi  also  small.  Therefore,  the  exact  values  of  qi 2  and  qj2  are  not  crucial 
in  determining  the  scattering  rates  because  the  TO  mode  contribution  is  at  least  two 
orders  of  magnitude  less  than  the  LO  mode.  The  above  displacement  pattern  Eq.(17) 
has  been  used  in  evaluating  the  interface  mode  scattering  for  the  confined  Ge-Si  and 
Ge-Ge  modes. 


III.  HEAVY  HOLE  SCATTERING  RATES 


The  nonpolar  optical  phonon  interaction  Hamiltonian  involving  a  heavy  hole  is18 


- D  ■  u 

Mx  +  M2 


(19) 


where  M\  and  M2  are  the  masses  of  the  two  atoms  in  the  unit  cell,  and  D  is  the  optical 
deformation  potential. 


The  normalization  of  the  displacement  amplitudes  is  carried  out  through  the 
approach  ot  equating  the  energy  of  the  vibration  mode  with  that  of  a  simple  harmonic 
oscillator9  as 

xi=you''udz -  <2o) 

where  L  is  the  well  width,  5  is  the  sample  surface  area  (in  ( x,y )  plane),  fl  is  the  volume 
of  the  unit  cell,  and  \  is  the  normal  coordinator  of  the  oscillator. 


The  heavy-hole  band  offset  is  calculated  taking  into  account  the  compressive 
strain  in  the  Sii_xGeT  well  region.19  The  heavy-hole  bands  are  decoupled  from  the  light- 
hole  and  split-off  bands  at  k  =  0  and  can  be  treated  independently.20  The  heavy-hole 
energy  levels  and  envelope  wavefunctions  are  obtained  by  the  finite  square  well  model  as 
shown  in  Fig.  1 .  The  heavy-hole  state  can  be  characterized  by  |k,  n  >  with  the  in-plane 
momentum  k  and  subband  index  n.  In  the  approximation  of  constant  effective  mass  for 
heavy  holes,  the  matrix  element  for  the  transition  from  state  |k,  n  >  to  |k',  n!  >  due  to 
nonpolar  optical  phonon  scattering  is 


<  k',  n'\H\\t ,  n  >=  < 


jh[n{u0)  +  1/2  1/2]  y 

2PlL0oSLAA(qz)  4'±qX’k  Dy°nn'(9z)  (S’T0) 
/%( Up)  +  1/2  tT/2]~ 

2piu>0SLAc(qL,qT)  k±qx,k 

•[DxG%n,(qL,qT)  +  DzGznn,(qL,qT)  ]  (hybrid), 


(21) 


for  the  s-TO  mode  and  the  hybrid  of  the  LO  and  p-TO  mode,  respectively.  n(u)0 )  is 
the  number  of  optical  phonons  at  thermal  equilibrium,  and  the  upper  and  lower  signs 
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compressive 

Sii_xGex 


1  - 1  intrasubband 
scattering 


Figure  i:  Heavy  hole  band  structure  and  scattering  processes. 

refer  to  phonon  absorption  and  emission,  respectively.  The  three component  of  h 
optical  deformation  potential.  D„  D„  and  D,  are  assumed  eqoal  to ■  D< ,  -  DJ  V3  m  he 
calculation  in  view  of  the  assumption  of  isotropy.  The  kronecker  symbol  indicates 
in  plane  U.  y)  momentum  conservation.  The  normalisation  iactors  are  given  by 


A.4(q*)  =  jJQ  u‘yuydz  (s-TO), 

a  c(wt)  =  \  /oW+  u>’)dz  thybrid)' 


(22) 


The  Gnn, -functions  contain  envelope  wavefunctions,  by  and  ipn,  from  which  interfer 
effect  can  be  obtained.  Specifically, 

Gln’iqz)  =  l  Wntfn'Uydz.  (23) 


for  the  s-TO  mode,  and 


fL 

Gxnn\qL,qr)  =  /  'bni’n'Uxdz, 
L 

Gnn'(qLi  QT)  =  J0  i’n'^n'Uzdz, 


(24) 


for  the  hybrid  of  LO  and  p-TO  mode.  It  should  be  noted  that  given  the  pmper  di¬ 
placement  expressions,  Eq.(21)  for  the  matrix  element  is  valid  for  both  the  0 
interface  modes  discussed  above. 

Obviously,  depending  on  the  alloy  composition  in  the  well  the  three  modes  (SbSn 
Ge-Si,  and  Ge-Ge)  will  have  different  interaction  strengths  with  the  carriers  in  th  Q 
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structure.  Specifically,  each  interaction  should  be  weaker  than  that  for  the  case  where 
it  is  the  only  existing  mode  and  therefore  needs  to  be  accounted  for  properly.  A  crude 
model  to  approximate  the  relative  strength  of  each  individual  mode  is  to  assign  a  proper 
weight  in  the  calculation  of  the  phonon  scattering  rate.  If  we  assume  bonds  formed 
between  Si-Si,  Ge-Si,  and  Ge-Ge  are  purely  random,  then  we  can  give  the  following 
weights  according  to  the  Ge  content,  x,  to  each  of  the  vibration  modes  in  the  Sii-^Ge^ 
well:  wsi  =  (1  —  x )2  for  Si-Si  mode,  WGeSi  —  2x(l  —  x)  for  Ge-Si  mode,  and  woe  =  x 2 
for  Ge-Ge  mode. 


Applying  the  Fermi  golden  rule,  we  obtain  the  scattering  rate  due  to  the  guided 

modes. 


W: 


wjmXh[n(«,0)  +  l/2Tl/2]D*  \Gynn,  |2 

2h2Pluj0L  ^ 92  Aa 

wjmhhin(uo)  +  1/2  1/2]  .Dp  |  Gxnn,  +  Gznn,  | 2 

'2h2  f)\uo(,L  Ul'qT  Ac 


(s-TO) 

(hybrid), 


(25) 


where  we  have  assumed  that  for  the  intersubband  process  (n  /  n')  the  heavy  holes  are 
scattered  from  the  bottom  of  their  original  subbands,  and  for  the  intrasubband  process 
{n  =  /?')  the  heavy  holes  have  just  enough  kinetic  energy  (equal  to  the  Si-Si  optical 
phonon  energy)  to  emit  an  optical  phonon  to  reach  to  the  bottom  of  the  same  subband. 
Wj  ( j  =Ge-Si,  Ge-Ge)  is  the  weight  assigned  to  a  particular  vibration  mode  and  m*hh  is 
the  heavy-hole  effective  mass. 


For  the  s-TO  guided  mode,  the  summation  in  Eq.(25)  is  over  all  qz  =  mr / L 
limited  by 


where  a }  is  the  lattice  constant  and  qx  is  given  by  the  constraint  of  the  in-plane  momen¬ 
tum  conservation.  For  the  hybrid  guided  mode,  the  summation  is  restricted  to  those 
combinations  of  qi  =  niir/L  and  qj  =  / L  which,  according  to  Eq.(6),  yield  discrete 

values  of  qx  satisfying  the  in-plane  momentum  conservation.  It  should  be  pointed  out 
that  an  unique  final  state  heavy-hole  wavevector  k'  (initial  k  =  0)  will  result  if  the 
phonon  frequency  dispersion  is  neglected,  corresponding  to  a  constant  optical  phonon 
energy.  Then  the  in-plane  momentum  conservation  would  be  impossible  since  the  dis¬ 
crete  values  of  qx  will  not  be  able  to  exactly  match  the  unique  value  of  k'.  However, 
taking  into  account  of  the  phonon  dispersion,  a  region  of  heavy-hole  wavevector  k'  will 
be  obtained  and  the  scattering  process  is  permitted  as  long  as  qx  lies  in  that  region. 


There  is  no  s-TO  interface  mode,  and  the  scattering  rate  due  to  the  hybrid 
interface  mode  can  be  given,  similarly, 


WL 


wjrnhh[n{U}o)  T  1/2  jF  1/2 ]D20  .  2 

2h2Plio0LAc  1  nn  1171,1 


(27) 


where  G^n,  and  Gznn,  are  functions  of  qx i,  qL2,  and  qj 2  with  qn  =  0.  The  determination 
of  the  phonon  wavevectors  in  the  well  and  barrier  regions  have  been  discussed  in  section 
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fi5ure  [ntrasubband  scattering  ,,es  in  the  low,,  subb.nd  C  -  1)  d»  »  Ge-Si  and  Ga-G,  guided 
and  interface  modes  as  a  function  of  well  width  with  *  -  O.o. 

U.  Eqs-(25i  and  (27)  are  used  bo  evaluate r  both i  the 

Intrasubband  Scattering 

The  intrasubband  scattering  rate  was  calculated  fuming  that 
originated  from  the  heavy-hole  state ^ivith  z  kinetic  s  o{  phonon  emission 

to  the  bottom  OI  the  same  subband,  therefore,  on  v  temperature.  The 

is  considered  in  the  calculation.  Al ■  reau  s  are  o  at  ^  metastable  strained 
structure  parameters  are  varied  tv  thm  the  \  jJ^ubband  transition  rates  (1-1) 

Sit-Gee  alloy  on  the  S.  substrate.  F-g-2  ^owsjhe  rM  ^  ^  Ge_gi  and  Ge.Ge  ^ded 
within  the  ground-state  heavy-hole  subba  (  )  be  seen 

and  interface  modes  as  a  function  oi  the  well width  for  a is  about  same 
that  for  the  intrasubband  process  the  strength  o  in  e  T)u.  is  in  COntrast  with  the 

order  of  magnitude  as  that  °*  “"eSp“bf*  ring  rates  of  both  guided  and  interface 
mtersubband  process  discussed  la^er.  increase  of  the  well  width  when 

modes  ton  Ge-Si  and  Ge-Ge  vibratrons  increases  with  .he  G.functim  OTth 

t  t  ^  ok  a\  Thic:  i=  Hue  to  the  increase  ot  tne  lnterierenuc  vj  ^ 

n  “  ^r'ven  m  **  <*)  «  ‘""r  tt  "n“ 

^^“SrwStSesIrther  since  the  envelope  functron  has 
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mostly  been  confined  within  well  region.  Examining  the  interface  scattering  rate  given 
in  Eq.(27),  one  would  expect  it  to  decrease  with  the  further  increase  of  the  well  width, 
which  is  clearly  demonstrated  in  Fig. 2  for  both  Ge-Si  and  Ge-Ge  phonon  interactions. 
A  similar  decrease  of  the  interface  scattering  rate  in  a  GaAs/AEGa^^As  QW  has  been 
reported.' 

For  the  guided  mode,  the  scattering  rate  is  given  by  summing  the  contributions 
from  the  s-TO  and  hybrid  of  LO  and  p-TO  modes.  However,  the  hybrid  interaction 
with  the  heavy  holes  is  much  weaker  than  the  s-TO  mode,  because  of  the  requirements 
on  the  hybrid  mode  to  simultaneously  satisfy  the  in-plane  momentum  conservation  and 
the  frequency  dispersion  Eq.(6).  This  excludes  the  interaction  of  most  hybrid  modes 
with  the  heavy  holes,  leaving  the  s-TO  modes  as  the  dominant  scattering  mechanism. 
In  fact,  the  number  of  allowed  s-TO  vibration  modes  will  increase  with  the  increase  of 
the  well  width.  It  is  easy  to  see  from  Eq.(23)  that  for  the  intrasubband  process  (n  =  n') 
the  interference  G-function  is  nonzero  only  for  even  s-TO  modes  {nq_  =  1,3,5,---), 
and  the  effective  contribution  is  from  the  lowest  order  mode  nqz  =  1  with  higher  order 
modes  being  at  least  two  orders  of  magnitude  less.  The  guided  mode  scattering  for  the 
intrasubband  process  is  therefore  practically  a  single  s-TO  mode  interaction  and  behaves 
similarly  to  the  interface  mode.  The  difference  between  the  Ge-Si  and  Ge-Ge  vibrations 
is  mainly  due  to  the  different  weights  assigned  to  them  according  to  the  Ge  content  in 
the  SixGei_x  alloy.  Our  calculation  also  showed  that  the  (2-2)  intrasubband  scattering 
process  has  very  similar  behavior  with  the  interface  mode  scattering  slightly  higher  than 
the  guided  mode,  but  overall  smaller  rates  compared  to  the  (1-1)  process. 

Fig. 3(a)  shows  the  total  scattering  rate  as  a  function  of  the  well  width,  which  is 
given  by  summing  the  contributions  from  the  Ge-Si,  Ge-Ge,  and  Si-Si  (treated  as  bulk¬ 
like)  vibrations.  Also  shown  in  Fig. 3(a)  is  the  result  calculated  neglecting  the  phonon 
confinement  and  treating  all  vibration  modes  as  bulk-like  for  comparison.  It  can  be  seen 
that  the  phonon  confinement  actually  enhances  the  intrasubband  scattering  process. 
However,  the  opposite  is  true  for  the  intersubband  process  to  be  shown  below.  Fig. 3(b) 
shows  the  same  result  as  a  function  of  the  Ge  content  in  the  alloy  for  the  well  width  of 
40 A.  The  difference  increases  with  the  increase  of  the  Ge  content  since  the  contributions 
from  the  confined  phonons  will  increase  while  that  from  the  unconfined  Si-Si  vibration 
decreases. 

Intersubband  Scattering 

The  intersubband  scattering  rate  was  calculated  assuming  that  the  transition 
originated  from  the  bottom  of  a  subband  n  with  zero  kinetic  energy  to  another  sub¬ 
band  (n'  7^  n).  Both  phonon  absorption  and  emission  processes  are  considered  in  the 
calculation.  Fig. 4(a)  shows  the  (2-1)  scattering  rate  with  the  Si-Si,  Ge-Si,  and  Ge-Ge 
components  as  a  function  of  well  width  for  a  Sio.sGeo.s/Si  QW.  Subband  2  appears  at 
the  well  width  of  20A.  The  total  scattering  is  obtained  by  summing  the  contributions 
derived  from  the  Ge-Si,  Ge-Ge,  and  Si-Si  vibrations  including  both  guided  and  interface 
modes.  But  the  interface  mode  scattering  is  at  least  two  orders  of  magnitude  weaker  than 
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well  width  (A) 


(a) 


Figure  fc  Total  intrasubband  scattering  rates  in  the  lowest  subband  (n  =  1)  due  to  Ge-Si, 
fined  modes  and  Si-Si  unconfined  modes,  compared  to  the  bulk  intrasubband 

no  confinement,  /a)  as  a  function  of  well  width  with  Ge  content  ,  =  0.5;  (b)  as  a  function  of  *  for 
well  width  of  40A. 
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Figure  4:  Total 
modes  and  Si-Si 
confinement,  (a) 

width  of  40A. 


intersubband  scattering  rates  from  subband  2  to  1  due  to  Ge-Si.  Ge-Ge  confine 
unconfined  modes,  compared  to  the  bulk  intersubband  scattering  rate  assuming  no 
as  a  function  of  well  width  with  Ge  content  x  =  0.5;  (b)  as  a  function  of  x  for  a  well 


the  guided  mode  for  the  intersubband  process.  Similar  weakness  ot  the  interface  mode 
has  been  shown  in  a  GaAs/AlrGai_rAs  QW.r  The  result  assuming  bulk-like  phonons  is 
also  shown  in  Fig.4(a)  for  comparison.  The  sharp  increase  of  the  scattering  rates  in  the 
small  well  width  region  (L  <  25.4)  is  once  again  due  to  the  increasing  confinement  m  the 
well  region  of  subband  2  which  leads  to  an  increase  of  the  interference  G-function.  Both 
scattering  intensities  from  Si-Si  and  bulk-like  phonons  reduces  with  further  increase  of 
the  well  width.  Since  initially  the  narrow  well  width  only  allows  a  small  number  o 
o-uided  modes,  the  guided  mode  scattering  is  weak  compared  to  the  Si-Si  scattering  an 
the  total  scattering  rate  follows  the  dependence  of  Si-Si  vibration. .  As  the  number  o 
allowed  guided  modes  increases  with  increasing  well  width,  the  guided  mode  scatter¬ 
ing  actually  surpasses  the  Si-Si  component,  leading  to  an  increase  of  the  total  scattering 
rate.  The  small  discontinuous  incremental  steps  in  the  Ge-Si,  Ge-Ge,  and  therefore  total 
scattering  curves,  are  due  to  the  discrete  nature  of  the  increase  in  the  number  of  allowed 
guided  modes  as  the  well  width  increases.  The  sudden  drops  in  the  curves  occur  when 
the  energy  separation  between  the  two  subbands  reduces  to  less  than  one  of  the  three 
phonon  energies  corresponding  to  different  vibrations.  Specifically,  the  drops  occurre 
at  L  =  72,  83,  99A  correspond  to  Si-Si  phonon  ( SASmeV ),  Ge-Si  phonon  (o0.8 meV), 
and  Ge-Ge  phonon  (37.4me7),  respectively.  It  can  be  seen  in  Fig.4(a)  that  t lie  phonon 
confinement  reduces  the  scattering  rate  by  a  factor  of  two  to  four  as  compared  to  the 
bulk-like  phonons.  It  is  therefore  important  to  take  into  account  the  phonon  confinement 
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effect  in  estimating  the  subband  lifetimes. 

Fig. 4(b)  shows  the  (2-1)  scattering  rates  as  a  function  of  Ge  content  in  the 
Sii_xGer  alloy  for  a  well  thickness  of  40A.  The  contributions  from  the  guided  modes 
increases  with  the  Ge  content  while  the  unconfined  Si-Si  component  reduces.  As  a  result, 
the  difference  between  the  total  scattering  rate  and  the  bulk-like  rate  widens  with  Ge 
content. 


IV.  SUMMARY  AND  DISCUSSION 

In  an  earlier  paper  we  treated  phonon-induced  intersubband  transitions  in  Sii-^Ge^/Si 
MQW  structures.1  The  absence  of  polar  optical  scattering  resulted  in  subband  lifetimes 
an  order  of  magnitude  larger  than  those  in  MQWs  of  III- V  semiconductors  and  without 
the  precipitous  decrease  at  the  optical  phonon  threshold.  This  earlier  work  assumed 
bulk-like,  three  dimensional  phonons.  Noting  strong  experimental  evidence  for  phonon 
confinement  in  Sii-xGe^/Si  MQWs,  the  present  paper  extends  the  earlier  treatment  to 
the  case  in  which  phonons  are  confined. 

A  single  Sii_xGex  QW  with  Si  barrier  is  considered.  The  decoupled  heavy-hole 
subbands  and  wavefunctions  are  determined  by  the  finite  square  well  model.  Assigning 
statistical  weights  to  the  Si-Si,  Ge-Si,  and  Ge-Ge  vibrations  in  the  QW  assuming  a  purely 
random  bond  model,  the  Ge-Si  and  Ge-Ge  modes  are  treated  as  confined  while  the  Si-Si 
modes  are  extended.  The  confined  modes  are  of  two  kinds,  guided  modes  and  interface 
modes.  The  guided  modes  consist  of  the  s-TO  and  coupled  LO  and  p-TO  modes,  with 
the  boundary  condition  that  the  displacements  vanish  at  the  interfaces.  The  interface 
mode  is  obtained  applying  hydrodynamic  boundary  conditions  to  the  LO  and  TO  modes 
separately  and  then  requiring  that  the  ^-component  of  their  sum  (not  the  ^-component) 
vanish  at  the  interfaces.  The  LO  mode  is  a  two-dimensional  bulk-type  solution  with 
constant  amplitude  in  the  well  and  propagating  parallel  to  the  interface;  the  TO  mode 
is  an  even  function  (referring  co  the  ^-component)  with  a  sinusoidal  spatial  dependence 
in  the  growth  (~)  direction  so  that  the  boundary  conditions  at  the  interfaces  can  be 
satisfied. 

The  Ge-Ge  optical  phonon  spectrum  in  the  well  region  has  no  overlap  with  that 
of  Si-Si  optical  phonons  in  the  barriers,  but  instead  overlaps  with  that  of  the  acoustic  Si 
phonons.  The  model  for  the  interface  mode  derived  from  the  optical  phonon  dispersion 
relations  using  the  hydrodynamic  boundary  conditions  is  not  without  criticism.  Further 
investigation  of  this  subject  is  underway  to  provide  a  more  accurate  estimate  of  the 
intrasubband  scattering  rates.  But  we  would  still  expect  the  intersubband  process  to 
be  dominated  by  the  guided-mode  scattering,  which  ultimately  determines  the  subband 
lifetimes. 

The  intrasubband  and  intersubband  scattering  rates  are  calculated  as  a  function 
of  the  QW  structure  parameters:  Ge  content,  x.,  in  the  Sii_xGex  well  and  the  well 
width.  L.  These  parameters  are  varied  within  the  limits  reportedly  to  produce  the 
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metastable  strained  layers  of  Sii-^Ge^  alloy  on  Si  substrates  without  introducing  a  large 
number  of  misfit  defects.  We  choose  the  Ge  content  of  x  =  0.5  as  we  vary  the  well 
width  to  give  a  fair  weight  of  the  Ge-Si  and  Ge-Ge  confined  modes.  We  use  the  fixed 
well  width  of  40 A  to  allow  at  least  two  confined  heavy-hole  subbands  as  we  vary  the 
Ge  content.  Our  results  are  as  follows.  For  intrasubband  scattering,  the  scattering  by 
the  interface  mode  is  nearly  the  same  as  that  for  the  guided  mode  (Fig. 2),  and  the 
total  intrasubband  scattering  with  confinement  is  larger  than  that  without  confinement 
(Fig.  3).  However  for  intersubband  scattering,  the  interface  mode  scattering  is  at  least 
two  orders  of  magnitude  smaller  than  that  for  the  guided  mode.  Finally,  as  can  be 
seen  from  Fig. 4,  the  total  intersubband  scattering  rate  with  confinement  is  a  factor  of 
two  to  four  smaller  than  that  which  would  be  obtained  assuming  bulk  phonons  (i.e. 
no  confinement).  Thus,  one  would  expect  a  factor  of  two  to  four  enhancement  in  the 
intersubband  lifetimes  which  determine  population  inversion  in  an  intersubband  laser. 
Since  phonon  confinement  is  well  documented  for  the  GaAs/AlAs  system,23  lifetime 
enhancements  should  also  be  present  here,  though  the  scattering  due  to  the  polar  modes 
is  stronger  and  where  the  splitting  of  the  LO  and  TO  modes  will  require  a  treatment 
different  from  the  present  nonpolar  case. 
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Abstract 

A  hybrid  numerical  technique  combining  the  moment  method  and  the  geometrical 
theory  of  diffraction  has  been  extended  to  allow  the  calculation  of  electromagnetic  scat¬ 
ter  from  lossy  dielectric  surfaces.  The  hybrid  technique  eliminates  the  non-physical 
edge  effects  that  are  introduced  in  standard  moment  method  implementations,  thereby 
allowing  the  application  at  extreme  grazing  angles.  The  dielectric  surface  is  represented 
using  impedance  boundary  conditions.  Sample  calculations  demonstrate  the  reduction 
in  scattering  from  a  rounded-apex  wedge  when  the  surface  conductivity  is  reduced.  The 
technique  should  allow  more  realistic  calculation  of  the  scattering  from  land  and  water 
surfaces  than  can  be  obtained  using  a  perfectly  conducting  surface. 
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I.  INTRODUCTION 


Traditional  implementations  of  the  moment  method  for  calculating  the  electromagnetic  scat¬ 
tering  from  rough  surfaces  have  been  limited  to  application  at  moderate  to  large  illumination 
grazing  angles  [1,  2].  Finite  computer  resources  limit  the  length  of  the  surface  that  can  be  nu¬ 
merically  modeled,  introducing  non-physical  edges  in  the  scattering  surface  that  can  lead  to 
unrealistic  diffractive  scattering.  These  “edge-effects”  are  often  avoided  using  an  illumination 
weighting  function  that  reduces  the  incident  field  to  negligible  levels  at  the  edges.  Thorsos 
[3]  showed  that  electromagnetically  valid  weighting  functions  can  yield  unrealistic  surface 
illumination  if  the  modeled  surface  is  insufficiently  large.  Unfortunately,  the  required  sur¬ 
face  length  increases  dramatically  with  decreasing  grazing  angle  (increasing  incidence  angle), 
limiting  the  smallest  grazing  angle  at  which  the  technique  can  be  applied. 

West  [4]  implemented  a  hybrid  numerical  technique  combining  the  moment  method  (MM) 
and  the  geometrical  theory  of  diffraction  (GTD)  that  overcomes  many  of  the  limitations  of 
the  traditional  moment  method  when  calculating  the  scattering  from  perfectly  conducting 
surfaces.  In  this  approach  the  surface  is  extended  to  infinity,  thereby  eliminating  the  artificial 
edges.  The  technique  has  been  used  to  investigate  the  effects  of  surface  self-shadowing  on 
the  backscattering  from  perfectly  conducting  surfaces  approximating  rough  ocean  waves  [4] 
and  the  effects  of  multi-path  on  backscattering  from  breaking  ocean  waves  [5].  This  hybrid 
technique  has  been  enhanced  to  allow  the  application  to  lossy  dielectric  surfaces,  thereby 
allowing  accurate  prediction  of  the  scattering  from  rough  land  surfaces  as  well  as  sea  surfaces. 
A  detailed  description  of  the  enhanced  technique  is  given  below,  as  well  as  sample  calculations 
of  the  scattering  from  lossy  dielectric  objects. 
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II.  OVERVIEW  OF  TECHNIQUE 

A.  Perfecthj  Conducting  Surfaces 

Application  of  the  hybrid  MM/GTD  numerical  technique  to  scattering  from  perfectly  conduct¬ 
ing  surface  is  described  in  [4].  A  detailed  review  is  given  here.  Adapted  from  the  technique 
described  by  Burnside  et  a/. [6],  it  is  quite  similar  to  the  standard  moment  method  in  that  scat¬ 
tering  from  the  surface  is  found  by  first  numerically  solving  an  integro-differential  equation 
to  yield  the  surface  current.  In  both  methods,  the  unknown  surface  current  is  represented  as 
a  summation  of  known  basis  functions.  The  weighting  coefficients  associated  with  each  basis 
function  that  give  the  “best”  approximate  solution  are  obtained  using  the  moment  method. 
The  primary  difference  between  the  two  techniques  is  that  the  hybrid  approach  uses  a  priori 
knowledge  of  the  current  obtained  from  GTD  to  define  well  behaved  basis  functions  and 
additional  source  terms  that  allow  the  treatment  of  special  infinitely  long  surfaces,  thereby 
avoiding  the  artificial  edge  effects  introduced  in  the  standard  MM.  The  surface  current  is 
then  radiated  to  yield  the  scattered  field. 

For  the  one-dimensionally  rough  surfaces  considered  here,  vertically  polarized  scattering 
is  best  described  by  the  magnetic  field  integral  equation  (MFfE)  [7]: 

//'(()  =  0.5 J,(l)  +  j f  /  Ml')  («'  •  p)  h!2103| p  -p'\)  dl' 

=  (1) 

where  /  is  the  arc  length  along  the  scattering  surface,  Hl(l)  is  the  incident  magnetic  field  at 
the  scattering  surface,  JS{1)  is  the  unknown  surface  current  to  be  found,  j3  is  the  free  space 
wave  number,  p  is  the  position  vector  of  the  observation  point,  p'  is  the  position  vector  of 
the  source  point,  n'  is  the  normal  unit  vector  at  the.  source  point,  and  is  the  first-order 
Hankel  function  of  the  second  type.  The  integration  is  the  principal  value  integral  (avoiding 
the  singularity  where  /  =  /')  over  the  entire  surface.  Horizontally  polarized  scattering  is  more 
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Figure  1:  Arbitrary  scattering  surface. 


easily  treated  by  the  electric  field  integral  equation  (EFIE): 


£■(/)  =  ^  j 

=  Le[UI)\, 


(2) 


where  7]0  is  the  intrinsic  wave  impedance  of  free  space.  The  MFIE  and  EFIE  can  be  written 
in  the  single  notation 

F'(l)  =  Lx[J,(l)],  (3) 

where  F  is  either  E  or  H  and  X  is  either  E  or  M.  In  the  standard  moment  method,  the 
infinite  integrations  in  equations  (1)  and  (2)  are  truncated  to  be  over  a  finite  surface  arc 
length.  The  current  on  the  modeled  length  L  is  then  divided  into  a  weighted  summation 
of  adjacent  pulse  basis  functions,  and  the  moment  method  is  used  to  find  the  associated 
weighting  coefficients.  It  is  the  truncation  of  the  integrations  that  lead  to  the  non-physical 
edge  effects. 

The  hybrid  technique  is  applied  to  one-dimensionally  rough  surfaces  of  the  form  shown 
in  Figure  1.  The  dashed  section  of  the  surface  represents  the  actual  rough  surface  while  the 
solid  line  represents  infinitely  long,  planar  extensions.  The  extensions  are  chosen  such  that  all 


points  on  the  actual  surface  are  shadowed  from  all  points  on  the  extension  (except  of  course 
at  the  intersection  points  B  and  C).  Because  the  surface  is  arbitrary,  little  is  known  initially 
about  the  current  between  points  A  and  D.  Thus,  the  current  in  this  region  is  described  using 
standard  MM  pulse  basis  functions  with  impulse  testing  functions  (yielding  point  matching) 
centered  on  the  basis  functions. 

Since  the  extensions  are  shadowed  from  the  arbitrary  surface  points,  the  fields  at  the 
surface  of  the  extensions  can  be  entirely  described  as  the  sum  of  a  field  diffracted  from  point 
B  or  C  plus  the  geometrical  optical  (GO)  incident  and  reflected  fields: 

Fl  =  fi  +  Fs  =  pGO  +  Fd ^  (4) 

where  F1  is  the  total  field,  Fl  is  the  incident  field,  Fs  is  the  scattered  field,  FG0  is  the 
gemetrical  optics  incident  and  reflected  fields,  and  Fd  is  the  diffracted  field.  The  current 
on  the  extension  is  obtained  by  applying  the  surface  boundary  conditions  to  equation  (4), 
yielding  the  physical  optics  current  associated  with  the  GO  fields  plus  an  additional  current 
component  associated  with  the  diffracted  field  (the  “diffraction-field  current”): 

Js  —  J po  "T  Jd'  (5) 

Since  the  extension  is  flat  and  perfectly  conducting,  the  PO  current  is  known  exactly  a  priori. 
(Note  that  if  the  extension  is  shadowed  from  the  incident  field  the  PO  current  is  simply  zero). 
However,  the  diffracted  field,  and  therefore  the  diffraction-field  current,  is  not  known  initially 
and  must  be  determined  using  the  moment  method.  Since  it  extends  to  infinity,  use  of 
ordinary  sub-domain  MM  basis  functions  to  describe  this  current  would  lead  to  an  infinite 
order  system  of  linear  equations  that  cannot  be  solved.  Instead  it  is  recognized  that  at 
distances  far  enough  away  from  the  diffraction  point  the  diffracted  field  is  ray  optical.  Thus, 


23-6 


Diffracted  Field 


<t> 

Figure  2:  Diffracted  field  in  the  vicinity  of  the  extensions, 
the  form  of  the  diffracted  field  at  the  extension  beyond  points  A  or  D  is  given  by 


Fd  =  F0—f(<j>),  (6) 

r 


where  r  is  the  distance  from  the  diffraction  point  and  is  an  arbitrary  function  of  the 

angular  cylindrical  coordinate  with  the  diffraction  point  as  the  origin,  as  shown  in  Figure  2. 
Applying  the  surface  boundary  condition  Js  =  n  x  H  yields  the  diffraction  currents 


e-A'r 

Jd  =  (vertical  polarization) 

Vr 

e~jkr 

—  J0 — — ,  (horizontal  polarization). 


(7) 


We  now  see  that  a  single  basis  function  of  the  form  of  equation  (7)  can  be  used  to  include  the 
diffraction  current  from  the  diffraction  point  to  infinity  in  the  hybrid  numerical  technique. 
This,  combined  with  the  known  physical  optics  currents,  entirely  describes  the  current  on  the 
infinite  extensions.  Since  there  are  no  discontinuities  on  the  modeled  surface,  no  artificial 
edge  effects  are  introduced. 

The  current  on  the  entire  surface  may  now  be  written  as 


Js  =  Jmm  +  Jd  +  Jpo-, 


(8) 
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where  Jmm  is  the  current  between  points  A  and  D  described  by  ordinary  MM  pulse  basis 
functions: 

N 

JmM  —  ^  ^?n)?  (9) 

771  =  1 

where  P(l  —  lm)  is  a  pulse  function  centered  at  lm  and  am  are  unknown  weighting  coefficients 
to  be  found  via  the  moment  method.  Jp>  includes  both  diffraction  current  terms: 

JD  =  aN+iJd\back+aN+2Jd\front,  (10) 

and  Jpo  is  the  physical  optics  current  on  the  front  and  back  faces  given  by 

Jpo=2nxHl|(„l+2,ixH'|/r„>„.  (11) 

Substituting  equation  (8)  into  equation  (3)  gives 

Fl  =  Lx[Jmm  +  Jd  +  Jpo ]■  (12) 

Because  the  Jpo  is  entirely  known  a  priori  and  L\[ }  is  a  linear  operator,  the  physical  optics 
term  may  be  moved  to  the  left  hand  side,  giving 

Fx  —  Lx[Jpo\  =  Lx[Jmm]  +  Lx[Jd]-  (13) 

Thus,  the  physical  optics  current  simply  appears  as  a  field  source  term  in  the  hybrid  technique. 
Evaluating  equation  (13)  at  the  centers  of  the  basis  functions  (point  matching  or  collation), 
plus  at  two  additional  points  on  the  extensions  yields  +  2  algebraically  linear  equations 
with  N  +  2  unknowns.  Solving  this  system  yields  the  moment  weighting  coefficients  am, 


E,  H  E,  H 


Figure  3:  Equivalent  problem  to  be  solved  with  lossy  dielectric  scatterer. 
completing  the  MM  solution  of  the  current.  The  far  field  scatter  is  then  determined  from 

Fs  =  -Lx[Jmm  +  Jd  +  Jpo]  r_fco-  (14) 

B.  Lossy  Dielectric  Surfaces 

When  the  scattering  surface  is  perfectly  conducting  a  true  surface  current  exists.  Thus,  the 
moment  method  solves  the  physical  scattering  problem  directly.  When  the  surface  is  not 
perfectly  conducting  a  surface  current  cannot  be  supported;  the  field  penetrates  the  surface 
and  a  volume  current  density  exists.  The  moment  method  is  not  well  suited  for  direct 
application  to  volume  current  problems.  Instead,  the  equivalence  principle  [7]  is  applied  as 
shown  in  Figure  3,  yielding  both  electric  (J)  and  magnetic  (M)  surface  current  densities  that 
radiate  the  desired  scattered  field.  Although  the  equivalent  problem  includes  only  surface 
currents,  the  moment  method  still  cannot  be  applied  directly  since  the  unknown  electric  and 
magnetic  currents  are  co-located.  Instead,  the  magnetic  current  is  expressed  in  terms  of  the 
electric  current  using  impedance  boundary  conditions  [8].  Assuming  that  the  conditions 

\N\  >  1,  |Im(Af)A,Y>(|»  1  (15) 
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where  N  is  the  complex  refractive  index  of  the  scattering  medium  and  pi  is  the  radius  of 
curvature  of  the  surface,  are  met  everywhere  on  the  surface,  the  field  penetrating  into  the 
surface  propagates  as  a  plane  wave  in  the  negative  surface  normal  direction.  The  two  surface 
current  components  can  then  be  related  by  [9] 

M  =  —Zsn  x  J,  (16) 

where  Zs  is  the  intrinsic  wave  impedance  of  the  lossy  dielectric. 

Applying  duality  to  equations  (1)  and  (2)  to  determine  the  near-held  radiation  of  the 
magnetic  current  density  and  using  equation  (16),  it  is  straightforward  to  show  that  the 
appropriate  two-dimensional  MFIE  for  determining  vertically  polarized  scattering  from  a 
lossy  dielectric  scatterer  is  [10] 

H\l)  =  LM[Js(l)\-—2LE[Js(l)l  (17) 

Vo 

Similarly,  with  a  lossy  dielectric  surface  the  EFIE  becomes 

E\l)  =  LE[Js(l )]  -  ZsLE[Js(l)].  (18) 

Since  equations  (17)  and  (18)  each  include  only  the  unknown  surface  current  Js  (and  not 
Ms )  they  are  well  suited  to  solution  using  moment  method  techniques. 

The  hybrid  MM/GTD  technique  can  be  extended  to  apply  to  equations  (17)  and  (18) 
to  find  the  scattering  from  lossy  dielectric  surfaces  of  the  type  shown  in  Figure  1  with  little 
modification.  The  surface  current  between  points  A  and  D  is  again  divided  into  pulse  basis 
functions  as  described  in  equation  (9),  and  the  diffraction-current  basis  functions  are  un¬ 
changed  from  equation  (7)  since  the  diffracted  held  is  still  ray  optical  at  suitable  distances 
from  the  diffraction  point  [11].  The  physical  optics  current  does  need  to  be  modified  slightly 
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since  the  surface  is  no  longer  perfectly  conducting: 


Jpo  =  (1  -  r)n  x  H'lj^-Hl  -  r)n  x  H*|/r„r  (19) 

where  T  is  the  appropriate  parallel  (vertical)  polarized  or  perpendicular  (horizontal)  polarized 
reflection  coefficient  on  the  front  and  back  extensions.  (Note  that  equation  (19)  reduces  to 
(11)  with  a  perfectly  conducting  surface.)  Substituting  equation  (8)  (with  the  modified  Jpo) 
into  equation  (17)  and  moving  the  known  terms  to  the  left  hand  (source)  side  yields 


H\l)  -  Lm{Jpo(1 )]  +  —2Le[Jpo{1))  (20) 

Vo 

=  Lm[Jmm{1)  +  Jd{1)\ - j Le[Jmm(1)  ~  Jd{1)]-  (21) 

Vo 

Similarly,  the  EFIE  becomes 

E\l)  -  Le[Jpo(1 )  +  Z.Lm[JP0(1 )]  (22) 

=  Le[Jmm{1)  +  Jd(1)]  —  ZsLm[Jmm{1 )  —  Jd{  0]-  (23) 


Both  equations  (20)  and  (22)  can  be  evaluated  at  the  N  +  2  matching  points,  and  the  resulting 
linear  system  algebraic  equations  solved  to  give  the  unknown  coefficients  cvn,  completing  the 
numerical  solutions.  The  far-field  scattering  from  the  surface  is  then  found  by  evaluating 

Hs  —  —Lm[Jmm  +  J d  +  Jpo\  +ZsLm[Jmm  +  J d  T  Jpo\  (24) 

L  r— >■  oo  r—f  oo 

or 

Z  1 

Es  =  —Le[Jmm  +  Jd  +  Jpo\  H — y  [Jmm  ~f~  Jd  +  Jpo]  j  (25) 

r—y  oo  lr  —roo 

1)  Implementation  considerations:  The  majority  of  the  Fellowship  was  spent  deriv¬ 
ing  efficient  implementations  of  equations  (20)  through  (25).  The  findings  are  summarized 
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here. 

Evaluation  of  Lx[Jpo\  proved  to  be  the  most  numerically  intensive  operation  in  the  hybrid 
technique.  It  requires  the  integration  to  infinity  of  an  integrand  that  oscillates  rapidly  and 
decays  slowly,  and  therefore  converges  quite  slowly.  Moreover,  these  terms  must  be  evaluated 
at  each  incidence  angle  examined  since  it  appears  as  a  source  term.  It  was  found  that  the 
convergence  can  be  dramatically  increased  by  evaluating  it  as  an  infinite  series  and  applying 
the  epsilon  convergence-acceleration  algorithm  [12]. 

Evaluation  of  L\[Jd ]  also  involves  the  infinite  integration  of  a  rapidly  oscillating,  slowly 
converging  integrand.  This  also  proved  well  suited  to  acceleration  via  the  epsilon  algorithm. 
Evaluation  of  Lx[Jmm\  is  unchanged  from  that  given  by  Axline  and  Fung  [1], 

The  radiation  of  Jmm  in  equations  (24)  and  (25)  was  accomplished  using  the  far-field 
approximations  given  by  Axline  and  Fung  [1],  Also  using  these  approximations  the  integ¬ 
rations  in  Lx[Jd{1)]  can  be  evaluated  in  terms  of  Fresnel  integrals  using  the  routines 
of  Press  et  al.  [13].  Evaluation  of  Lx[Jpo( 0]  required  more  consideration.  Using  the 
approximation  of  Axline  and  Fung  yields  an  integrand  that  does  not  decay  out  to  infinity, 
and  therefore  technically  has  no  solution.  Instead,  the  exact  integral  was  evaluated  at  a  very 
large  (but  finite)  observation  range  using  an  asymptotic  end-point  expansion  [14]. 

III.  APPLICATION 

The  numerical  routines  were  first  tested  by  finding  the  scattering  from  a  lossy  dielectric 
cylinder.  While  a  cylinder  is  not  a  surface  of  the  type  shown  in  Figure  1  and  does  not  require 
use  of  the  hybrid  MM/GTD  technique,  it  does  give  a  test  case  in  which  the  impedance 
boundary  moment-method  scattering  can  be  compared  with  an  exact  solution.  A  cylinder 
with  a  complex  dielectric  constant  of  70  -  j  50  was  chosen  to  approximate  the  properties  of  sea 
water  at  microwave  frequencies.  The  results  are  shown  in  Figure  4.  The  moment  calculations 
agree  with  the  exact  solution  to  within  1  dB  at  all  scattering  angles  at  both  horizontal 
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BISTATIC  SCATTERING  FROM  DIELECTIC  CYLINDER 

£r  =  70  -  j  50;  Radius  =  0.1  X 


Figure  4:  Comparison  of  moment  method  and  exact  calculation  of  scattering  from  a  lossy 
dielectric  cylinder. 
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Figure  5:  Rounded  wedge  scattering  surface. 


polarization  (the  electric  field  is  parallel  to  the  cylinder  axis)  and  vertical  polarization  (the 
electric  field  is  perpendicular  to  the  axis),  confirming  the  validity  of  the  approach  at  both 
polarizations. 

The  scattering  from  a  wedge  with  a  rounded  apex,  as  shown  in  Figure  5,  was  calculated 
using  the  full  implementation  of  the  hybrid  MM/GTD  technique.  The  radius  of  curvature  of 
the  apex  was  set  at  0.5A  and  the  interior  angle  of  the  wedge  was  120°.  The  scattering  was 
calculated  with  the  real  part  of  the  dielectric  constant  fixed  at  70  and  imaginary  part  ranging 
from  infinity  (perfectly  conducting  case)  down  to  10.  The  results  are  shown  in  Figures  6  and 
7.  The  incidence  angles  in  the  figures  are  referenced  to  vertical.  At  incidence  angles  ranging 
from  —30°  to  30°  the  bac.kscattering  is  dominated  by  specular  reflection  from  the  apex.  (The 
singularities  occuring  at  —30°  and  30°  are  due  to  specular  reflection  from  the  infinitely  long 
extensions).  Since  the  conditions  of  equation  (15)  are  met,  at  these  incidence  angles  the 
reduction  in  the  backscattered  field  with  decreasing  surface  conductivity  should  be  directly 
proportional  to  the  normal  incidence,  flat-surface  reflection  coefficient.  When  er  =  70  —  j’10 
the  magnitude  of  the  reflection  coefficient  is  0.788,  indicating  that  the  scattering  from  this 
surface  should  be  very  close  to  2.1  dB  below  that  from  the  perfectly  conducting  surface  in 
this  region.  The  actual  reductions  at  0°  incidence  agree  with  this  value  to  within  0.15  dB  at 
both  polarizations. 

At  incidence  angles  beyond  30°  (—30°)  the  scattering  is  no  longer  dominated  by  specular 
reflection,  but  instead  is  due  entirely  to  back  diffraction,  both  from  the  discontinuity  in  the 


23-14 


SCATTERING  FROM  ROUNDED  WEDGE 

Vertical  Polarization 


Figure  6:  Scattering  from  rounded-apex  wedge:  vertical  polarization. 


SCATTERING  FROM  ROUNDED  WEDGE 

Horizontal  Polarization 


Figure  7:  Scattering  from  rounded-apex  wedge:  horizontal  polarization. 
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radius  of  curvature  where  the  rounded  apex  begins  [15]  and  the  point  where  the  incidence 
vector  is  exactly  tangential  to  the  apex  [16].  Thus,  the  change  in  the  scattering  due  to  a 
reduction  of  the  surface  conductivity  cannot  be  easily  predicted.  However,  the  reductions 
observed  in  this  region  are  quite  realistic.  It  is  interesting  to  note  that  the  reduction  in  the 
region  is  significantly  greater  at  vertical  polarization.  This  most  likely  occurs  because  the 
transmission  into  the  surface  is  greater  at  this  polarization  when  the  local  incidence  angle  is 
not  normal. 


IV.  CONCLUSIONS 

A  hybrid  numerical  technique  combining  the  moment  method  and  the  geometrical  theory  of 
diffraction  has  been  enhanced  to  allow  the  calculation  of  the  electromagnetic  scattering  from 
a  lossy  dielectric  surface.  Sample  calculations  of  scattering  from  a  rounded  wedge  show  that 
the  expected  reduction  in  specular  scattering  from  a  lossy  surface  is  accurately  predicted. 
Realistic  reductions  were  also  predicted  in  the  diffractive  scattering  from  the  surface.  This 
technique  should  allow  more  accurate  prediction  of  the  scattering  from  land  and  sea  surfaces 
than  can  be  obtained  using  techniques  that  assume  a  perfectly  conducting  surface. 
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Abstract 


Future  developments  in  superconductor-semiconductor  microwave  devices  depend  on 
depositing  high-quality  high-temperature  superconductor  (HTS)  thin  film  on 
semiconductors.  Buffer  layer  technology  using  Y-stabilized  Zr02  (YSZ)  as  a  buffering 
layer  and  MgO  as  a  capping  layer  is  one  of  the  most  promising  processes  for  growing  HTS 
such  as  Yba2Cu307.g  (YBCO)  on  GaAs  substrate.  The  superconducting  properties  of  YBCO 
depend  on  the  quality  of  the  buffer  layer,  i.e.  c-oriented,  smooth,  and  free  of  defects,  which 
in  turn  depends  on  the  quality  of  the  capping  layer.  The  defect  structure  in  the  MgO 
capping  layer  deposited  by  sputtering  is  examined  in  the  present  study.  Defects  such  as  pin 
holes,  precipitates,  impurity  induced  structures,  bubbles,  and  steps  are  identified.  Films 
with  pin  holes  are  found  to  be  ineffective  as  a  chemical  barrier.  They  are  also  poor 
templates  for  growing  c-oriented  YSZ  films.  Other  defects  only  have  localized  effects  with 
no  observable  structural  effects  on  the  YSZ  buffer  layer.  Pin  holes  in  the  MgO  layer  can 
be  eliminated  by  reducing  the  target  to  sample  distance. 
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DEFECTS  IN  SPUTTER  DEPOSITED  MGO  FILMS  AS  A  GAAS  CAPPING  LAYER 
FOR  DEPOSITING  HIGH  TEMPERATURE  SUPERCONDUCTING  FILMS 


Peter  Ka-Chai  Wu 


L  Introduction 

The  advantages  of  using  high-temperature  superconductors  (HTS)  in  devices  is 
demonstrated  in  microwave  devices  such  as  Josephson  junction  and  dc  superconducting 
quantum  interference  devices  fabricated  using  HTS  such  as  Yba2Cu307.s  (YBCO).1'3  Most 
of  the  HTS  devices  are  fabricated  on  a  single  crystal  oxide  surface  which  serves  as  a 
template  as  well  as  a  chemical  buffer  layer  for  the  HTS.  There  is  however  no  reliable  and 
reproducible  methodology  to  grow  high-quality  HTS  thin  films  on  semiconductors  which  is 
essential  for  fabricating  integrated  superconductor-semiconductor  devices.4  Integration  of 
HTS  devices  with  semiconductor  devices  can  enhance  the  performance,  improve  the 
reliability,  and  reduce  the  cost. 

The  most  widely  used  technique  to  grow  high  quality  YBCO  films  on 
semiconductors  is  to  use  a  buffer  layer.5  6  Materials  such  as  MgO,  Ce02,  Y-stabilized  Zr02 
(YSZ),  and  SrTi03  are  good  buffer  layers  for  YBCO  grown  on  GaAs  substrates.  GaAs  is 
an  attractive  substrate  because  of  its  low  dielectric  constant  or  low  loss  tangent  for 
microwave  applications  and  the  low  cost.  High  quality  YBCO  thin  films  with  a  critical 
temperature  (Tc)  of  >  90  K  and  a  critical  current  (Jc)  of  5  X  106  A/cm2  at  77  K  are 
routinely  grown  on  single  crystal  oxide  substrates.  Tc  and  Jc  of  these  YBCO  thin  films 
were  found  to  be  very  sensitive  to  materials  properties  such  as  grain  size  and  orientation. 
Films  with  the  highest  Jc  are  c-axis  oriented  because  the  most  favorable  transport  properties 
are  on  the  a-b  plane.  Jc  thus  increases  as  c-axis  alignment  is  enhanced.7  Jc  is  also  very 
sensitive  to  structures  such  as  grain  size  and  grain  boundary.8  9 

The  difficulties  in  obtaining  high-quality  YBCO  thin  films  on  GaAs  are  as  follows: 
(1)  GaAs  is  not  stable  under  the  oxidizing  environment  needed  to  grow  HTS  thin  films;  (2) 
without  a  capping  layer,  GaAs  decomposes  at  temperatures  above  600°C,  which  is  required 
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to  grow  the  HTS  film;  (3)  GaAs  reacts  with  the  HTS  and  destroys  its  superconducting 
property;  (4)  crystalline  YBCO  film  will  not  form  because  of  the  large  lattice  mismatch 
between  GaAs  and  YBCO;  and  (5)  there  is  a  significant  difference  in  the  thermal  expansion 
coefficients.  A  buffer  layer  such  as  MgO11  or  YSZ12  can  circumvent  most  of  these 
problems. 

The  buffer  layer  must  be  a  good  template  to  grow  c-oriented  YBCO.  It  must  also 
remain  smooth  and  free  of  defects,  because  the  Jc  of  the  YBCO  film  subsequently  deposited 
is  very  sensitive  to  structures  such  as  grain  size  and  grain  boundary.14-15  For  small 
misorientation  angles,  the  ratio  of  the  grain-boundary  critical  current  density  to  the  bulk 
critical  current  density  is  roughly  proportional  to  the  inverse  of  the  misorientation  angle; 
for  large  angles,  this  ratio  saturates  to  a  value  of  about  1/50. 16  A  textured,  e.g.,  c-axis 
oriented,  film  with  in-plane  rotation  or  mosaic  can  reduce  superconducting  properties. 

YSZ  is  superior  to  MgO  as  a  buffer  layer  because  of  a  better  match  in  lattice 
constant  and  thermal  coefficient  of  expansion  with  YBCO.  Tc  as  high  as  92  K  has  been 
reported  for  YBCO  thin  films  grown  on  YSZ  substrates.13  It  is  thus  important  to  develop  a 
method  of  depositing  high  quality  YSZ,  i.e.,  c-oriented,  smooth,  free  of  structural  defects, 
on  GaAs  substrates.  To  deposit  high  quality  YSZ  films,  the  minimum  substrate 
temperature  (Ts)  required  is  600°C.  This  is  not  acceptable  as  GaAs  will  decompose  at  these 
temperatures.  This  problem  can  be  circumvented  by  first  depositing  a  layer  of  MgO,  -100 
nm  thick,  to  cap  the  GaAs  before  depositing  the  YSZ.  MgO  is  deposited  as  Ts  is  raised 
from  room  temperature  to  650°C,  the  YSZ  deposition  temperature.  The  MgO  will  prevent 
the  GaAs  from  decomposing.  The  quality  of  the  YSZ  subsequently  deposited  is  dependent 
on  the  quality  of  the  MgO  capping  layer. 

MgO  can  be  deposited  using  direct  sputtering  of  an  MgO  target  or  reactive 
sputtering  of  metallic  Mg.  Both  methods  require  an  abient  with  02.  The  goal  of  this  work 
is  to  identify  the  defect  structure  of  the  MgO  deposited.  The  reaction  or  interdiffusion  of 
chemical  species  between  the  GaAs  and  MgO  is  examined.  The  defects  formed  at  the  MgO 
capping  layer  can  propagate  through  the  YSZ  layer  and  ultimately  affect  the  quality  of  the 
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YBCO  thin  film.  The  effects  of  the  defects  on  the  quality  of  the  YSZ  buffer  are  examined. 
The  optimal  deposition  parameter  necessary  to  eliminate  defects  was  tested  and  identified. 

II.  Experimental 

GaAs  substrates  used  are  high  resistance  single  crystal  wafers.  Wafers  are  dipped  in 
HjSCVHjCVHjO  =  5:1:1  (by  volume)  solution  before  introduction  into  the  deposition 
chamber.  No  other  in  situ  treatment  is  done  prior  to  deposition. 

MgO  films  are  deposited  using  RF  sputtering  of  either  a  MgO  target  or  metallic  Mg 
target.  Sputtering  power  of  100  W  is  used  in  both  cases.  Ts  at  the  beginning  of  deposition 
is  25 °C.  It  is  then  raised  to  a  final  temperature  of  600°C  in  1200  sec  and  held  constant 
throughout  the  deposition.  In  the  case  of  the  MgO  target,  a  gas  mixture  of  2.7  Pa  of  Ar 
and  0.13  Pa  of  02  is  used.  In  the  case  of  the  metal  Mg  target,  a  gas  mixture  of  1.3  Pa  of 
Ar  and  0.13  Pa  of  02  is  used.  The  deposition  rates  for  using  the  MgO  target  and  the  Mg 
target  are  0.38  and  0.55  nm/min  respectively. 

YSZ  is  deposited  using  RF  sputtering  of  an  YSZ  target.  The  sputtering  power  is 
120  W.  The  gas  mixture  is  composed  of  3.3  Pa  of  Ar  and  3.3  Pa  of  02.  Ts  is  kept  at 
700 °C  throughout  the  deposition. 

Film  orientation  is  identified  using  X-ray  Diffraction  (XRD).  Optical  and  Scanning 
Electron  Microscopy  (SEM),  and  Atomic  Force  Microscopy  (AFM)  are  used  to  identify 
surface  structures,  grain  shape  and  size,  as  well  as  roughness.  Energy-dispersive  X-ray 
Spectrometry  (EDX),  X-ray  Photoelectron  Spectroscopy  (XPS),  and  Auger  Electron 
Spectroscopy  (AES)  are  used  to  determine  composition  and  chemical  states. 

IIL_  Results  and  Discussion 

Several  types  of  defects  are  identified  on  the  MgO  film.  These  are  pin  holes, 
precipitates,  defect  induced  structures,  bubbles,  and  steps.  Pin  holes  when  they  are  present 
are  distributed  throughout  the  entire  surface.  The  other  defects  are  found  to  be  localized. 

Pin  holes  can  be  eliminated  by  reducing  the  target  to  sample  distance. 
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III. a.  Pin  holes 


One  of  the  most  important 
defects  we  found  on  the  MgO  films  is 
pin  holes.  The  SEM  micrograph  of  a 
MgO  surface  with  pin  holes  deposited 
using  an  MgO  target  shows  that  the 
pin  holes  are  evenly  distributed  on 
the  entire  surface  of  the  MgO  films. 
An  AFM  micrograph,  Figure  1, 
shows  that  the  average  grain  size 
between  pin  holes  is  500  nm  with  a 
mean  surface  roughness  of  28.9  nm. 
Because  the  height  range,  220  nm,  is 
on  the  order  of  the  thickness  of  the 


film  and  the  AFM  cantilever  is 
incapable  of  reaching  into  tight  areas, 
It  is  likely  that  the  pin  holes  extend 
all  the  way  to  the  substrate  surface. 

Figure  2  shows  an  expanded 
view  of  one  of  the  pin  holes.  The 
large  grains  observed  in  Figure  2  are 
composed  of  several  smaller  grains. 
The  pin  holes  are  formed  where  three 
or  more  of  these  grains  touched. 

This  type  of  microstructure  indicates 
a  coalescent  process  where  smaller 
grains  combined  to  form  a  larger 


Figure  1.  AFM  micrograph  of  a  MgO  film  with  pin 


three  dimensional  grain.  Above  a  Figure  2.  AFM  micrograph  of  a  pin  holesin  the  MgO 

film,  (height  range  =  150  nm,  size  1  pm  X  1  pm) 
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certain  size,  larger  grains  are  less 
likely  to  coalesce  because  of  the  larger 
energy  barrier  for  such  a  process. 

When  neighboring  grains  grow  and 
touch  each  other  a  pin  hole  is  formed. 

The  micrograph  from  the 
surface  of  a  MgO  film  without  pin 
hole,  Figure  3,  shows  an  average  grain 
size  of  50  nm  and  a  mean  surface 
roughness  of  3.7  nm.  The  average 
grain  is  much  smaller  than  that  shown 
in  Figure  1  and  is  mainly  spherical.  A 
large  range  of  grain  size  is  present. 

Figure  3.  AFM  micrograph  of  a  MgO  film  without  pin 
This  means  that  no  substantial  amount  holes,  (height  range  =  20  nm,  size  =  1  pm  X  1  pm) 

of  coalescing  is  present.  These  results  indicate  that  pin  holes  are  formed  when  the 

coalescent  rate  is  higher  than  the  nucleation  and  growth  rate. 

The  result  of  an  AES  depth  profile  of  a  30  nm  thick  MgO  film  with  pin  holes  is 
shown  in  Figure  4.  Figure  4  shows  the  profile  of  the  entire  film  plus  part  of  the  GaAs 
substrate.  The  MgO  layer  and  the  GaAs  substrate  is  clearly  identified.  The  MgO/GaAs 
interface  is  not  a  sharp  interface.  An  interfacial  region  containing  Ga  is  shown  extending 
into  the  MgO.  Thus  Ga  or  oxides  of  Ga  are  reacting  with  or  diffusing  into  the  MgO.  In 
contrast  no  substantial  amount  of  As  is  found  in  the  MgO  matrix. 

XRD  measurements  on  the  MgO  film  with  pin  holes  shows  a  polycrystalline 
structure.  YSZ  deposited  on  these  MgO  films  shows  no  preferred  (001)  orientation.  This 
is  not  surprising  as  the  YSZ  is  grown  on  a  very  rough  surface  and  the  orientation  of  the 
YSZ  grain  orientation  depends  on  its  nucleation  sites  on  the  MgO  grain. 
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To  determine  the  chemical  state  of  the  Ga  in  the  interfacial  region,  an  4.8  nm  thick 
MgO  film  is  deposited  and  examined  with  XPS.  This  thickness  is  chosen  so  that  the 
photoelectron  from  the  interfacial  region  can  escape  from  the  surface.  Possible  new 
chemical  states  promoted  by  the  sputtering  process  are  eliminated.  Figure  5  shows  the 
Ga(2p  3/2),  Mg(2p),  and  O(ls)  spectra  from  such  a  film.  The  O(ls)  spectrum  is  composed 
mainly  of  two  peaks  indicating  two  major  types  of  oxide  in  the  sample.  The  main  peak  in 
the  Mg(2p)  spectrum  has  the  correct  energy  shift  for  MgO.  The  Ga(2p  3/2)  spectrum 
shows  one  peak  indicating  one  major  chemical  state,  which  has  the  correct  chemical  shift 
for  Ga203.  Both  the  fitted  Mg(2p)  and  Ga(2p  3/2)  peaks  are  less  than  1  eV  wide  indicating 
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Figure  5.  XPS  spectra  from  an  MgO  film  4.8  nm  thick.  Dashed  lines  are  fitted  results. 


X-axis  is  Binding  Energy  (eV);  top  spectrum,  O(ls);  middle  spectrum  Mg(2p);  and 


bottom  spectrum  Ga(2p  3/2) 
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one  major  chemical  state  contributes 
to  each  peak.  The  XPS  result  thus 
shows  the  interfacial  region 
composed  mainly  of  Ga203  and  MgO 
and  not  a  reaction  product  of  the 
MgO  and  the  substrate. 

An  AFM  micrograph,  Figure 
6,  of  an  YSZ  film  grown  on  MgO  an 
film  with  pin  holes  shows  the  YSZ 
surface  to  be  smooth  as  compared  to 
the  MgO  underneath.  The  average 
grain  size  is  200  nm  with  a  mean 
surface  roughness  of  2.3  nm.  Thus 


Figure  6.  AFM  micrograph  of  a  YSZ  film  grown  on  an 


MgO  film  with  pin  holes. 


Sputter  Time  (min) 


Figure  7.  AES  depth  profile  of  a  YSZ  film  grown  on  an  MgO  film  with  pin  holes. 
X  -  O,  □  -  Mg,  ☆  -  As,  0  -  Y,  I  -  Zr,  and  a  -  Ga. 
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even  though  the  substrate  is  rough,  the  YSZ  will  fill  in  the  pin  holes  and  result  in  a 
smoother  film. 

The  AES  depth  profile  from  a  sample  with  YSZ  deposited  on  a  MgO  film  with  pin 
holes  is  shown  in  Figure  7.  Figure  7  represents  both  films  and  part  of  the  GaAs  substrate. 
Both  the  YSZ  and  the  MgO  layers  and  the  GaAs  substrate  were  clearly  identified.  The 
interfacial  region  found  in  Figure  4  is  also  identified  here.  The  MgO  and  interfacial  region 
thickness  are  estimated  to  be  40  nm.  Even  though  the  Ga203  diffuses  almost  through  the 
MgO  layer,  no  mixing  or  reaction  is  evident  in  the  YSZ  layer. 

Figure  8  shows  an  AES  depth  profile  from  an  MgO  film  without  pin  holes.  This 
profile  represents  all  of  the  MgO  films  and  part  of  the  GaAs  substrate.  The  interfacial 
region  observed  in  Figure  4  and  7  is  not  present  here  indicating  no  chemical  reaction 
between  the  MgO  and  the  GaAs  substrate.  This  result  agrees  with  the  XPS  result  in  that 
the  Ga  present  in  the  interfacial  region  is  a  result  of  Ga203  diffusing  into  the  MgO  instead 


Figure  8.  AES  depth  profile  of  an  MgO  film  without  pin  holes. 
X  -  0,  a  -  Ga,  □  -  Mg,  and  ☆  -  As 
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of  a  chemical  reaction  between  the 
overlayer  and  the  substrate. 
Furthermore,  the  dominant  diffusion 
path  is  through  the  pin  holes. 

Surface  or  grain  boundary  diffusion 
should  not  be  the  main  diffusion 
mechanism.  Because  of  the  smaller 
grain  size,  the  total  surface  area  of 
MgO  grains  is  much  larger  in  films 
without  pin  holes  than  that  with  pin 
holes.  If  surface  diffusion  is  the 
dominant  transport  mechanism,  one 
would  expect  a  thicker  interfacial 
region  than  that  shown  in  Figure  8. 


1 50  nm,  size  =  2  pm  X  2  pm) 


This  is  evidently  not  the  case.  The  main  diffusion  venue  is  thus  through  the  pin  holes. 


IH.b.  Precipitates 

Precipitates  are  found  on  some  films.  These  are  small  particles  distributed  evenly 
across  the  surface.  The  majority  of  the  precipitates  are  circular  in  shape  with  an  average 
diameter  of  350  nm  and  50  nm  in  height,  see  the  round  particle  in  Figure  9.  Some  larger 
precipitates  are  also  found,  see  the  larger  elongated  particle  in  Figure  9.  These  precipitates 
are  found  on  reactively  sputtered  MgO  films.  It  is  thus  possible  that  they  are  a  result  of 
segregation  from  the  matrix  due  to  composition  differences.  However,  no  chemical 
identification  is  available.  Work  is  required  to  determine  deposition  conditions  which  will 
eliminate  these  precipitates. 
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III.c.  Impurity  induced  growth 
morphology 

Figures  10  and  11  show  a 
special  structure  found  on  some  of 
the  MgO  films.  Figure  10  shows  the 
morphological  structure  surrounding 
a  particle  in  the  center.  Figure  1 1  is 
an  enlarged  view  of  the  structure  near 
the  particle  in  the  center.  The  extent 
of  this  type  of  structure  is  20  to  30 
pm  in  diameter.  The  morphology  of 
the  MgO  in  this  area  is  very  different 
from  that  of  the  normal  MgO  film. 

Figure  10  .  AFM  micrograph  of  the  structure  shown  in 
see  Figure  1.  Several  structures  like  Figure  11  enlarged  5X.  (height  range  =  1pm,  size  =  6 

this  were  found  located  randomly  on  X  6  pip) 

the  surface.  The  composition  or  the 

origin  of  the  particles  in  the  center  of 

this  structure  are  not  known.  These 

structures  were  only  found  on  a  few 

films  and  their  appearance  does  not 

show  any  trend  in  any  deposition 

condition.  It  is  thus  likely  that  these 

particles  are  contaminants  due  to 

careless  substrate  preparation. 


Ill.d.  Bubbles 

Bubbles  are  found  on  MgO 
films  deposited  using  reactive 


Figure  11.  AFM  micrograph  of  Impurities  induced 
structures,  see  text,  (height  range  =  1  pm,  size  =  20  pm 
X  20  pm) 
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sputtering  of  metal  targets.  Figure  12 
shows  one  of  these  bubbles.  This 
particular  bubble  has  a  diameter  of  6 
fj. m  and  a  height  of  220  nm.  Some  of 
the  bubbles  will  burst.  Figure  13 
shows  one  of  these  burst  bubbles. 

This  bubble  has  a  diameter  of  8  /un. 
We  have  used  AFM  to  probe  the 
inside  of  the  burst  bubble.  It  is  found 
that  the  bottom  of  the  bubble  is  flat. 
The  bottom  of  the  burst  bubble  is 
found  to  be  280  nm  deep.  The 
presence  of  trapped  bubbles  in  the 
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Figure  12.  AFM  micrograph  of  a  bubbie  in  a  MgO  film. 
MgO  film  is  not  acceptable.  This  is  (height  range  =  500  nm,  size  =  10  pm  X  10  pm ) 


because  the  intended  application  of 
these  films  is  in  superconducting 
devices.  Bubbles,  especially  those 
with  trapped  gas,  can  be  detrimental 
to  devices  during  thermal  cycling. 

Bubbles  can  be  a  result  of 
trapped  gas  during  sputter  deposition 
or  internal  chemical  segregation 
leading  to  release  of  gas.  Thus 
varying  gas  pressures  arid  mixtures  as 
well  as  changing  deposition  rates 
should  make  a  difference.  Work  is 
being  done  to  eliminate  these  types  of 
defects . 


Figure  13.  AFM  micrograph  of  a  burst  bubble  on  a 
MgO  film,  (height  range  =  900  nm,  size  =  15  pm  X  15 


pm) 
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Hl.d.  Steps 

Steps  on  the  MgO  surface  are 
found  on  the  edges  of  the  samples. 

Figure  14  shows  some  of  these  steps. 

Cross  section  analysis  of  these 
structures  shows  flat  terraces  between 
step  edges.  Terrace  height 
determined  by  cross-sectional  study 
ranges  from  13  to  26  nm  .  There  are 
holes  on  some  of  the  terraces.  The 
holes  also  have  flat  bottom,  the  depth 
of  these  holes  has  the  same  range  as 
the  terrace  height.  Some  step  edges 

Figure  14.  AFM  micrograph  of  steps  on  a  MgO  film, 
delaminated  from  the  next  terrace  (Height  range  =  300  nm,  size  =  5  pm  X  5  pm) 

down  and  curled  up.  This  result  and  the  result  from  the  burst  bubble,  Figure  13,  indicates 

that  the  MgO  is  grown  in  a  layer-by-layer  fashion.  Because  steps  are  only  found  at  the 

edges  of  the  sample,  the  cause  of  the  steps  can  be  a  result  of  structural  defects  or  the 

temperature  gradient  during  deposition  at  the  edge  of  the  sample. 

IV.  Conclusion 

Several  types  of  defects  of  sputter  deposited  MgO  films  on  GaAs  are  identified. 

They  are  pin  holes,  precipitates,  impurity  induced  structures,  bubbles,  and  steps.  MgO 
films  with  pin  holes  are  poor  chemical  buffers  because  of  the  diffusion  of  Ga203  through  the 
MgO  film.  These  films  are  also  bad  templates  for  growing  a  c-oriented  YSZ  buffer  layer. 
YSZ  deposited  on  the  MgO  surface  with  pin  holes  will  fill  the  holes  and  grow  into  a 
comparatively  smooth  surface.  Pin  holes  can  be  eliminated  by  reducing  the  sample  to  target 
distance  during  sputter  deposition.  All  the  other  defects  were  found  to  be  localized  and 
have  no  detectable  long  range  effects  on  the  YSZ  films  subsequently  deposited.  High 
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quality  YSZ  films  can  be  deposited  on  GaAs  using  the  MgO  as  a  capping  layer.  Good 
MgO  films  can  be  deposited  using  deposition  condition  stated  earlier  and  with  a  sample-to- 
target  distance  of  less  than  5  cm.  '  ■ 
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